In many real-world scenarios where extrinsic rewards to the agent are extremely sparse, curiosity has emerged as a useful concept providing intrinsic rewards that enable the agent to explore its environment and acquire information to achieve its goals. Despite their strong performance on many sparse-reward tasks, existing curiosity approaches rely on an overly holistic view of state transitions, and do not allow for a structured understanding of specific aspects of the environment. In this paper, we formulate curiosity based on grounded question answering by encouraging the agent to ask questions about the environment and be curious when the answers to these questions change. We show that natural language questions encourage the agent to uncover specific knowledge about their environment such as the physical properties of objects as well as their spatial relationships with other objects, which serve as valuable curiosity rewards to solve sparse-reward tasks more efficiently.
翻译:在许多真实的情景中,对代理人的外部奖励极为稀少,好奇心已成为一个有用的概念,它提供了内在奖励,使代理人能够探索其环境并获取信息以实现其目标。尽管在很多微薄的奖励任务上表现强劲,但现有的好奇心方法依赖于对国家转型的过分全面的看法,无法对环境的具体方面有条理的理解。在本文中,我们根据基于问题的回答提出好奇心,鼓励代理人询问环境问题,并在这些问题的答案发生变化时好奇。我们表明,自然语言问题鼓励代理人发现有关其环境的具体知识,例如物体的物理特性以及它们与其他物体的空间关系,这些是宝贵的好奇心回报,可以更有效地解决稀薄的奖励任务。