突破传统 AI 训练！USTC 提出 Role-Agent 双角色共演机制

张

张建站

2026/6/12 1:44:53

10分钟阅读

Role-Agent: Bootstrapping LLM Agents via Dual-Role EvolutionAuthors: Xucong Wang, Ziyu Ma, Shidong Yang, Tongwen Huang, Pengkun Wang, Yong Wang, Xiangxiang Chu (USTC AMAP, Alibaba) |Year: 2026 |arXiv: 2606.10917二、研究背景LLM Agent 的学习受限于两个问题(1)低效的交互反馈——传统强化学习通常只有稀疏的最终奖励(2)静态训练环境——训练数据固定无法针对失败模式进行针对性练习。Role-Agent 的核心洞察LLM 本身具有足够的世界知识可以模拟环境动态同时具备分析自身失败的能力可以主动选择练习题。四、实验结果在编程、导航、知识问答等多个 Agent 基准上评测相比强基线平均提升4%WIA 的过程奖励在长时序任务中效果尤为显著AIW 的失败模式检索有效将练习集中于已知弱点报告生成时间2026-06-11 | 论文来源arXiv:2606.10917原文摘要:Although Large Language Model (LLM) agents have demonstrated strong performance on complex tasks, their learning is often limited by inefficient interaction feedback and static training environments, which hinder broader generalization. To address these limitations, this paper introduces Role-Agent, \textcolor{black}{a framework} that harnesses a single LLM to function concurrently as both the agent and the environment, enabling a bootstrapped co-evolution. Role-Agent comprises two synergistic components: World-In-Agent (WIA) and Agent-In-World (AIW). In WIA, the LLM acts as the agent and predicts future states after each action; the alignment between predicted and actual states is then used as a process reward, encouraging environment-aware reasoning. In AIW, the LLM analyzes failure modes from failed trajectories and retrieves tasks with similar failure patterns, thereby reshaping the training data distribution for targeted practice. Experiments on multiple benchmarks show that Role-Agent consistently improves performance, yielding an average gain of over 4% over strong baselines.PDF链接:https://arxiv.org/pdf/2606.10917v1部分平台可能图片显示异常请以我的博客内容为准

VisualCppRedist AIO技术解析：系统化解决Windows运行库兼容性挑战

VisualCppRedist AIO技术解析：系统化解决Windows运行库兼容性挑战【免费下载链接】vcredist AIO Repack for latest Microsoft Visual C Redistributable Runtimes 项目地址: https://gitcode.com/gh_mirrors/vc/vcredist 在Windows应用程序生态系统中&…...

2026/6/12 1:42:54 阅读更多 →

40+格式一网打尽：open3mod让你的3D模型查看体验起飞 [特殊字符]

40格式一网打尽：open3mod让你的3D模型查看体验起飞 🚀 【免费下载链接】open3mod Open 3D Model Viewer - A quick and powerful 3D model viewer 项目地址: https://gitcode.com/gh_mirrors/op/open3mod 还在为不同3D文件格式的兼容性烦恼吗&…...

2026/6/12 1:37:55 阅读更多 →

Adobe-GenP 3.0：三步解锁Adobe全家桶的终极破解方案

Adobe-GenP 3.0：三步解锁Adobe全家桶的终极破解方案【免费下载链接】Adobe-GenP Adobe CC 2019/2020/2021/2022/2023 GenP Universal Patch 3.0 项目地址: https://gitcode.com/gh_mirrors/ad/Adobe-GenP Adobe-GenP 3.0是一款功能强大的Adobe通用破解工具&…...

2026/6/12 1:37:00 阅读更多 →

JPEXS Free Flash Decompiler：SWF逆向工程架构解析与技术实践

JPEXS Free Flash Decompiler：SWF逆向工程架构解析与技术实践【免费下载链接】jpexs-decompiler JPEXS Free Flash Decompiler 项目地址: https://gitcode.com/gh_mirrors/jp/jpexs-decompiler JPEXS Free Flash Decompiler是一款基于Java开发的开源SWF文件…...

2026/6/11 13:26:37 阅读更多 →