While AI agents have long been discussed and studied in computer science, today's Agentic AI systems are something new. We consider other definitions of Agentic AI and propose a new realist definition. Agentic AI is a software delivery mechanism, comparable to software as a service (SaaS), which puts an application to work autonomously in a complex enterprise setting. Recent advances in large language models (LLMs) as foundation models have driven excitement in Agentic AI. We note, however, that Agentic AI systems are primarily applications, not foundations, and so their success depends on validation by end users and principal stakeholders. The tools and techniques needed by the principal users to validate their applications are quite different from the tools and techniques used to evaluate foundation models. Ironically, with good validation measures in place, in many cases the foundation models can be replaced with much simpler, faster, and more interpretable models that handle core logic. When it comes to Agentic AI, validity is what you need. LLMs are one option that might achieve it.
翻译:尽管人工智能代理在计算机科学领域已被长期探讨与研究,但如今的智能体人工智能系统呈现出全新形态。我们审视了其他关于智能体人工智能的定义,并提出一种新的现实主义定义:智能体人工智能是一种软件交付机制,类似于软件即服务(SaaS),其使应用程序能够在复杂的企业环境中自主运行。近期大型语言模型(LLMs)作为基础模型取得的进展,推动了智能体人工智能领域的蓬勃发展。然而我们注意到,智能体人工智能系统本质上是应用而非基础架构,因此其成功取决于终端用户及主要利益相关者的验证。主要用户验证应用程序所需的工具和技术,与评估基础模型所用的工具技术存在显著差异。具有讽刺意味的是,当建立完善的验证机制后,许多情况下基础模型可被更简洁、快速、可解释的核心逻辑处理模型所替代。对于智能体人工智能而言,有效性才是根本需求,而大型语言模型仅是可能实现该目标的选项之一。