有效性乃必需之物 (Validity Is What You Need)

While AI agents have long been discussed and studied in computer science, today's Agentic AI systems are something new. We consider other definitions of Agentic AI and propose a new realist definition. Agentic AI is a software delivery mechanism, comparable to software as a service (SaaS), which puts an application to work autonomously in a complex enterprise setting. Recent advances in large language models (LLMs) as foundation models have driven excitement in Agentic AI. We note, however, that Agentic AI systems are primarily applications, not foundations, and so their success depends on validation by end users and principal stakeholders. The tools and techniques needed by the principal users to validate their applications are quite different from the tools and techniques used to evaluate foundation models. Ironically, with good validation measures in place, in many cases the foundation models can be replaced with much simpler, faster, and more interpretable models that handle core logic. When it comes to Agentic AI, validity is what you need. LLMs are one option that might achieve it.

翻译：尽管人工智能代理在计算机科学领域已被长期探讨与研究，但如今的智能体人工智能系统呈现出全新形态。我们审视了其他关于智能体人工智能的定义，并提出一种新的现实主义定义：智能体人工智能是一种软件交付机制，类似于软件即服务（SaaS），其使应用程序能够在复杂的企业环境中自主运行。近期大型语言模型（LLMs）作为基础模型取得的进展，推动了智能体人工智能领域的蓬勃发展。然而我们注意到，智能体人工智能系统本质上是应用而非基础架构，因此其成功取决于终端用户及主要利益相关者的验证。主要用户验证应用程序所需的工具和技术，与评估基础模型所用的工具技术存在显著差异。具有讽刺意味的是，当建立完善的验证机制后，许多情况下基础模型可被更简洁、快速、可解释的核心逻辑处理模型所替代。对于智能体人工智能而言，有效性才是根本需求，而大型语言模型仅是可能实现该目标的选项之一。

相关内容

关注 7076

人工智能杂志AI(Artificial Intelligence)是目前公认的发表该领域最新研究成果的主要国际论坛。该期刊欢迎有关AI广泛方面的论文，这些论文构成了整个领域的进步，也欢迎介绍人工智能应用的论文，但重点应该放在新的和新颖的人工智能方法如何提高应用领域的性能，而不是介绍传统人工智能方法的另一个应用。关于应用的论文应该描述一个原则性的解决方案，强调其新颖性，并对正在开发的人工智能技术进行深入的评估。官网地址：http://dblp.uni-trier.de/db/journals/ai/

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

专知会员服务

34+阅读 · 2019年10月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日