Reasoning about potential occlusions is essential for robots to efficiently predict whether an object exists in an environment. Though existing work shows that a robot with active perception can achieve various tasks, it is still unclear if occlusion reasoning can be achieved. To answer this question, we introduce the task of robotic object existence prediction: when being asked about an object, a robot needs to move as few steps as possible around a table with randomly placed objects to predict whether the queried object exists. To address this problem, we propose a novel recurrent neural network model that can be jointly trained with supervised and reinforcement learning methods using a curriculum training strategy. Experimental results show that 1) both active perception and occlusion reasoning are necessary to successfully achieve the task; 2) the proposed model demonstrates a good occlusion reasoning ability by achieving a similar prediction accuracy to an exhaustive exploration baseline while requiring only about $10\%$ of the baseline's number of movement steps on average; and 3) the model generalizes to novel object combinations with a moderate loss of accuracy.
翻译:有关潜在隔离的考虑对于机器人有效预测某一物体在环境中是否存在至关重要。虽然现有工作表明,具有积极认知的机器人能够完成各种任务,但还不清楚能否实现隔离推理。为了回答这个问题,我们引入了机器人存在预测的任务:当被问及某一物体时,机器人需要尽可能少地围绕带有随机放置物体的表格移动,以预测是否存在被询问对象。为了解决这一问题,我们提议了一个新的经常性神经网络模型,可以使用课程培训战略,通过监督和强化学习方法共同培训。实验结果表明:(1) 成功完成这项任务需要积极认知和隔离推理两个方面;(2) 拟议模型显示良好的隔离推理能力,即实现与详尽探索基线相似的预测准确性,同时平均只需要大约10美元的基准移动步骤数量;(3) 模型一般化新物体组合,但有一定程度的准确性损失。