Robot manipulation in cluttered scenes often requires contact-rich interactions with objects. It can be more economical to interact via non-prehensile actions, for example, push through other objects to get to the desired grasp pose, instead of deliberate prehensile rearrangement of the scene. For each object in a scene, depending on its properties, the robot may or may not be allowed to make contact with, tilt, or topple it. To ensure that these constraints are satisfied during non-prehensile interactions, a planner can query a physics-based simulator to evaluate the complex multi-body interactions caused by robot actions. Unfortunately, it is infeasible to query the simulator for thousands of actions that need to be evaluated in a typical planning problem as each simulation is time-consuming. In this work, we show that (i) manipulation tasks (specifically pick-and-place style tasks from a tabletop or a refrigerator) can often be solved by restricting robot-object interactions to adaptive motion primitives in a plan, (ii) these actions can be incorporated as subgoals within a multi-heuristic search framework, and (iii) limiting interactions to these actions can help reduce the time spent querying the simulator during planning by up to 40x in comparison to baseline algorithms. Our algorithm is evaluated in simulation and in the real-world on a PR2 robot using PyBullet as our physics-based simulator. Supplementary video: \url{https://youtu.be/ABQc7JbeJPM}.
翻译:在杂乱的场景中进行机器人操作通常需要与物体进行接触性互动。通过非抓握操作,例如将物品推动到所需的抓握姿势,而不是通过故意的抓握重排整个场景,可能更加经济。针对场景中的每个物体,根据其属性,机器人可能允许与其接触、倾斜或倾倒。为确保这些约束在非抓握交互期间得到满足,规划器可以查询基于物理学的模拟器以评估机器人动作引起的复杂多体互动。不幸的是,在一个典型的规划问题中需要评估成千上万个需要模拟的动作是不可行的,因为每次模拟都是耗时的。在这项工作中,我们展示了(i)操作任务(具体来说是从桌面或冰箱中拾取和放置物品的任务)通常可以通过将机器人-物体交互限制在计划中的自适应动作原语来解决,(ii)这些动作可以作为多启发式搜索框架中的子目标,(iii)限制交互到这些操作可以帮助减少规划期间查询模拟器的时间,与基准算法相比可降低40倍。我们的算法在PyBullet作为我们基于物理学的模拟器的仿真环境中进行评估,并且在PR2机器人上进行了真实世界测试。补充视频:\url {https://youtu.be/ABQc7JbeJPM}。