We are at the cusp of a transition from "learning from data" to "learning what data to learn from" as a central focus of artificial intelligence (AI) research. While the first-order learning problem is not completely solved, large models under unified architectures, such as transformers, have shifted the learning bottleneck from how to effectively train our models to how to effectively acquire and use task-relevant data. This problem, which we frame as exploration, is a universal aspect of learning in open-ended domains, such as the real world. Although the study of exploration in AI is largely limited to the field of reinforcement learning, we argue that exploration is essential to all learning systems, including supervised learning. We propose the problem of generalized exploration to conceptually unify exploration-driven learning between supervised learning and reinforcement learning, allowing us to highlight key similarities across learning settings and open research challenges. Importantly, generalized exploration serves as a necessary objective for maintaining open-ended learning processes, which in continually learning to discover and solve new problems, provides a promising path to more general intelligence.
翻译:我们正处在从“从数据中学习”到“学习什么数据从中学习”的过渡阶段,这是人工智能(AI)研究的中心焦点。虽然第一级学习问题尚未完全解决,但是在统一架构下的大型模型,例如变压器,已经将学习瓶颈从如何有效培训我们的模型转向如何有效获取和使用与任务相关的数据。我们作为探索框架的这一问题是开放领域学习的一个普遍方面,例如真实世界。尽管AI的探索研究主要局限于强化学习领域,但我们认为,探索对所有学习系统,包括监督学习,都至关重要。我们提出了在概念上统一受监督的学习与强化学习之间的探索驱动学习,从而使我们能够突出学习环境之间的关键相似之处和开放研究挑战。重要的是,普遍探索是保持开放学习进程的必要目标,在不断学习以发现和解决新问题的过程中,为更普遍的智慧提供了一条有希望的道路。