Large language model based multi-agent systems (MAS) have unlocked significant advancements in tackling complex problems, but their increasing capability introduces a structural fragility that makes them difficult to debug. A key obstacle to improving their reliability is the severe scarcity of large-scale, diverse datasets for error attribution, as existing resources rely on costly and unscalable manual annotation. To address this bottleneck, we introduce Aegis, a novel framework for Automated error generation and attribution for multi-agent systems. Aegis constructs a large dataset of 9,533 trajectories with annotated faulty agents and error modes, covering diverse MAS architectures and task domains. This is achieved using a LLM-based manipulator that can adaptively inject context-aware errors into successful execution trajectories. Leveraging fine-grained labels and the structured arrangement of positive-negative sample pairs, Aegis supports three different learning paradigms: Supervised Fine-Tuning, Reinforcement Learning, and Contrastive Learning. We develop learning methods for each paradigm. Comprehensive experiments show that trained models consistently achieve substantial improvements in error attribution. Notably, several of our fine-tuned LLMs demonstrate performance competitive with or superior to proprietary models an order of magnitude larger, validating our automated data generation framework as a crucial resource for developing more robust and interpretable multi-agent systems. Our project website is available at https://kfq20.github.io/Aegis-Website/.
翻译:基于大型语言模型的多智能体系统在解决复杂问题方面取得了显著进展,但其不断增强的能力也带来了结构性脆弱问题,使得系统调试变得困难。提升此类系统可靠性的主要障碍在于大规模多样化错误归因数据集的严重匮乏,现有资源依赖成本高昂且难以扩展的人工标注。为突破这一瓶颈,我们提出了Aegis——一个面向多智能体系统的自动化错误生成与归因新框架。Aegis构建了包含9,533条轨迹的大规模数据集,涵盖多样化多智能体架构与任务领域,每条轨迹均标注了故障智能体与错误模式。该数据集通过基于大型语言模型的操纵器实现,该操纵器能够将上下文感知错误自适应注入成功执行轨迹。借助细粒度标签与正负样本对的结构化编排,Aegis支持三种不同学习范式:监督微调、强化学习与对比学习。我们为每种范式开发了相应学习方法。综合实验表明,经训练的模型在错误归因任务上持续取得显著提升。值得注意的是,我们多个微调后的大型语言模型展现出与规模大一个数量级的专有模型相当或更优的性能,这验证了我们自动化数据生成框架作为开发更鲁棒、可解释多智能体系统关键资源的价值。项目网站详见 https://kfq20.github.io/Aegis-Website/。