The DARPA Lifelong Learning Machines (L2M) program seeks to yield advances in artificial intelligence (AI) systems so that they are capable of learning (and improving) continuously, leveraging data on one task to improve performance on another, and doing so in a computationally sustainable way. Performers on this program developed systems capable of performing a diverse range of functions, including autonomous driving, real-time strategy, and drone simulation. These systems featured a diverse range of characteristics (e.g., task structure, lifetime duration), and an immediate challenge faced by the program's testing and evaluation team was measuring system performance across these different settings. This document, developed in close collaboration with DARPA and the program performers, outlines a formalism for constructing and characterizing the performance of agents performing lifelong learning scenarios.
翻译:DARPA终身学习机(L2M)方案力求在人造情报系统(AI)方面取得进步,使其能够持续学习(和改进),利用一项任务的数据提高另一项任务的业绩,并以计算上可持续的方式这样做。该方案的表演者开发了能够履行各种功能的系统,包括自主驾驶、实时战略和无人机模拟。这些系统具有多种多样的特点(例如任务结构、寿命期限),该方案的测试和评价小组面临的一项直接挑战就是衡量不同环境的系统业绩。这份文件是与DARPA和方案执行者密切合作编写的,概述了构建和描述从事终身学习情景的代理机构的业绩的正规主义。