Large Language Model (LLM) agents have demonstrated remarkable capabilities in organizing and executing complex tasks, and many such agents are now widely used in various application scenarios. However, developing these agents requires carefully designed workflows, carefully crafted prompts, and iterative tuning, which requires LLM techniques and domain-specific expertise. These hand-crafted limitations hinder the scalability and cost-effectiveness of LLM agents across a wide range of industries. To address these challenges, we propose \textbf{InfiAgent}, a Pyramid-like DAG-based Multi-Agent Framework that can be applied to \textbf{infi}nite scenarios, which introduces several key innovations: a generalized "agent-as-a-tool" mechanism that automatically decomposes complex agents into hierarchical multi-agent systems; a dual-audit mechanism that ensures the quality and stability of task completion; an agent routing function that enables efficient task-agent matching; and an agent self-evolution mechanism that autonomously restructures the agent DAG based on new tasks, poor performance, or optimization opportunities. Furthermore, InfiAgent's atomic task design supports agent parallelism, significantly improving execution efficiency. This framework evolves into a versatile pyramid-like multi-agent system capable of solving a wide range of problems. Evaluations on multiple benchmarks demonstrate that InfiAgent achieves 9.9\% higher performance compared to ADAS (similar auto-generated agent framework), while a case study of the AI research assistant InfiHelper shows that it generates scientific papers that have received recognition from human reviewers at top-tier IEEE conferences.
翻译:大型语言模型(LLM)智能体在组织与执行复杂任务方面已展现出卓越能力,目前多种此类智能体已广泛应用于各类实际场景。然而,开发这类智能体需要精心设计的工作流程、细致构建的提示词以及迭代调优,这既要求掌握LLM技术又需具备领域专业知识。这些人工设计的局限性制约了LLM智能体在各行业规模化应用的成本效益。为应对这些挑战,我们提出\textbf{InfiAgent}——一种基于有向无环图(DAG)的金字塔式多智能体框架,可适用于\textbf{无限}场景。该框架引入多项关键创新:1)通用化的“智能体即工具”机制,可自动将复杂智能体分解为层次化多智能体系统;2)双重审核机制,保障任务完成的质量与稳定性;3)智能体路由功能,实现高效的任务-智能体匹配;4)智能体自演进机制,能依据新任务、性能瓶颈或优化机会自主重构智能体DAG。此外,InfiAgent的原子化任务设计支持智能体并行执行,显著提升运行效率。该框架最终演化为可解决广泛问题的通用金字塔式多智能体系统。在多基准测试上的评估表明,InfiAgent相比ADAS(同类自动生成智能体框架)性能提升9.9%;以AI科研助手InfiHelper的案例研究显示,其生成的学术论文已获得IEEE顶级会议评审专家的认可。