Embodied AI is a recent research area that aims at creating intelligent agents that can move and operate inside an environment. Existing approaches in this field demand the agents to act in completely new and unexplored scenes. However, this setting is far from realistic use cases that instead require executing multiple tasks in the same environment. Even if the environment changes over time, the agent could still count on its global knowledge about the scene while trying to adapt its internal representation to the current state of the environment. To make a step towards this setting, we propose Spot the Difference: a novel task for Embodied AI where the agent has access to an outdated map of the environment and needs to recover the correct layout in a fixed time budget. To this end, we collect a new dataset of occupancy maps starting from existing datasets of 3D spaces and generating a number of possible layouts for a single environment. This dataset can be employed in the popular Habitat simulator and is fully compliant with existing methods that employ reconstructed occupancy maps during navigation. Furthermore, we propose an exploration policy that can take advantage of previous knowledge of the environment and identify changes in the scene faster and more effectively than existing agents. Experimental results show that the proposed architecture outperforms existing state-of-the-art models for exploration on this new setting.
翻译:嵌入的AI是最近一个研究领域,目的是创造智能剂,在环境中移动和操作。该领域的现有方法要求代理人在完全新的和未探索的场景中采取行动。然而,这一环境远非现实地使用需要在同一环境中执行多重任务的案例。即使环境随着时间的变化而变化,代理人仍然可以依靠其对现场的全球知识,同时努力使其内部代表结构适应当前环境状况。为了朝着这一环境方向迈出一步,我们提议“点点点点点点点点点”:Embodied AI的一项新任务,该代理人可以在那里获得过时的环境地图,需要在一个固定的时间预算中恢复正确的布局。为此,我们从现有的3D空间数据集开始,收集新的占用地图数据集,为单一环境生成若干可能的布局。该数据集可以用于受欢迎的生境模拟器,并且完全符合在导航过程中使用重建的占用地图的现有方法。此外,我们提议了一项探索政策,可以利用以前对环境的了解,并查明在固定预算时间内恢复正确布局的布局。为此,我们从现有的3D空间空间的数据集开始,为单一环境生成若干可能的布局。这个数据集可以完全用于在流行的模拟中,并且完全符合在导航过程中使用现有的地图。我们提出能够利用以前对环境的了解,在新的建筑外更快速、更精确地显示新的构造。实验结果。