Object permanence in psychology means knowing that objects still exist even if they are no longer visible. It is a crucial concept for robots to operate autonomously in uncontrolled environments. Existing approaches learn object permanence from low-level perception, but perform poorly on more complex scenarios, like when objects are contained and carried by others. Knowledge about manipulation actions performed on an object prior to its disappearance allows us to reason about its location, e.g., that the object has been placed in a carrier. In this paper we argue that object permanence can be improved when the robot uses knowledge about executed actions and describe an approach to infer hidden object states from agent actions. We show that considering agent actions not only improves rule-based reasoning models but also purely neural approaches, showing its general applicability. Then, we conduct quantitative experiments on a snitch localization task using a dataset of 1,371 synthesized videos, where we compare the performance of different object permanence models with and without action annotations. We demonstrate that models with action annotations can significantly increase performance of both neural and rule-based approaches. Finally, we evaluate the usability of our approach in real-world applications by conducting qualitative experiments with two Universal Robots (UR5 and UR16e) in both lab and industrial settings. The robots complete benchmark tasks for a gearbox assembly and demonstrate the object permanence capabilities with real sensor data in an industrial environment.
翻译:心理学中的物体永久性意味着知道物体即使不再可见,也仍然存在。这是机器人在不受控制的环境中自主运行的关键概念。 现有方法从低层次的认知中学习物体永久性,但在更复杂的情景中表现不佳,例如物体被控制和携带。 有关在物体消失之前对物体进行的操纵行动的知识使我们能够了解物体的位置,例如,物体被放置在一个载体中。 在本文件中,我们争辩说,当机器人使用关于已执行行动的知识,并描述一种从代理人的行动推断隐藏物体状态的方法时,物体永久性是可以改进的。 我们表明,考虑代理人行动不仅改进基于规则的推理模型,而且纯粹的神经方法,显示其普遍适用性。 然后,我们利用1 371个合成视频数据集,对一个告密的本地化任务进行定量实验,我们把不同物体永久模型的性模型的性能与没有行动说明加以比较。 我们证明,有行动说明的模型可以大大提高神经和基于规则的方法的性能。 最后,我们评估我们的方法在现实世界中的实用性,我们不仅改进了基于规则的推算模型,而且用两个实验室级的实验室级标准, 也展示了一种实验室级的实验室级的模型。