Embodied AI has shown promising results on an abundance of robotic tasks in simulation, including visual navigation and manipulation. The prior work generally pursues high success rates with shortest paths while largely ignoring the problems caused by collision during interaction. This lack of prioritization is understandable: in simulated environments there is no inherent cost to breaking virtual objects. As a result, well-trained agents frequently have catastrophic collision with objects despite final success. In the robotics community, where the cost of collision is large, collision avoidance is a long-standing and crucial topic to ensure that robots can be safely deployed in the real world. In this work, we take the first step towards collision/disturbance-free embodied AI agents for visual mobile manipulation, facilitating safe deployment in real robots. We develop a new disturbance-avoidance methodology at the heart of which is the auxiliary task of disturbance prediction. When combined with a disturbance penalty, our auxiliary task greatly enhances sample efficiency and final performance by knowledge distillation of disturbance into the agent. Our experiments on ManipulaTHOR show that, on testing scenes with novel objects, our method improves the success rate from 61.7% to 85.6% and the success rate without disturbance from 29.8% to 50.2% over the original baseline. Extensive ablation studies show the value of our pipelined approach. Project site is at https://sites.google.com/view/disturb-free
翻译:模拟中大量机器人任务(包括视觉导航和操纵)的模拟中,大赦国际已经展示了可喜的成果。先前的工作通常追求高成功率,使用最短的路径,而基本上忽视互动过程中碰撞造成的问题。这种缺乏优先排序是可以理解的:在模拟环境中,破坏虚拟物体没有内在成本。因此,训练有素的代理人经常与物体发生灾难性碰撞,尽管最终成功。在碰撞成本高的机器人界,避免碰撞是一个长期和关键的主题,以确保机器人能够安全地部署在现实世界。在这项工作中,我们迈出第一步,为视觉移动操作而实现无碰撞/潮暴动装饰的AI剂,便利在真实机器人中安全部署。我们开发了一种新的扰动避免方法,其核心是扰动预测的辅助任务。在与扰动惩罚相结合时,我们的辅助任务通过将干扰的知识蒸馏到代理人中,大大提高样本效率和最终性能。我们在ManipulthOR的实验中显示,在测试新物体的场景场上,我们的方法将成功率从61.7%提高到85.6%/比例,在50摄氏度上,而成功率则显示在原始基准点为50摄氏点。