As more and more AI agents are used in practice, it is time to think about how to make these agents fully autonomous so that they can learn by themselves in a self-motivated and self-supervised manner rather than being retrained periodically on the initiation of human engineers using expanded training data. As the real-world is an open environment with unknowns or novelties, detecting novelties or unknowns, characterizing them, accommodating or adapting to them, gathering ground-truth training data, and incrementally learning the unknowns/novelties are critical to making the agent more and more knowledgeable and powerful over time. The key challenge is how to automate the process so that it is carried out on the agent's own initiative and through its own interactions with humans and the environment. Since an AI agent usually has a performance task, characterizing each novelty becomes critical and necessary so that the agent can formulate an appropriate response to adapt its behavior to accommodate the novelty and to learn from it to improve the agent's adaptation capability and task performance. The process goes continually without termination. This paper proposes a theoretic framework for this learning paradigm to promote the research of building Self-initiated Open world Learning (SOL) agents. An example SOL agent is also described.
翻译:由于在实践中越来越多地使用AI代理物,现在应该考虑如何使这些代理物完全自主,以便他们能够以自我激励和自我监督的方式自我学习,而不是在使用扩大的培训数据开始使用人类工程师时定期接受再培训。现实世界是一个开放的环境,具有未知或新颖性,发现新事物或新事物,将其定性,适应或适应它们,收集地面真实培训数据,并逐步了解未知事物/新事物,对于使该代理物随着时间的推移越来越了解和强大至关重要。关键的挑战是如何使该过程自动化,以便通过该代理物自己主动地并通过其自身与人类和环境的互动来进行。由于一个AI代理物通常有工作任务,因此每个新事物的特点变得至关重要和必要,以便该代理物能够制定适当的对策,调整其行为,以适应新事物,并从中学习如何提高该代理物的适应能力和任务性。这一过程将持续进行,而没有终止。本文件提出一个学习模式的理论框架,以促进建设自导世界代理物的研究。