Robot manipulation in cluttered scenes often requires contact-rich interactions with objects. It can be more economical to interact via non-prehensile actions, for example, push through other objects to get to the desired grasp pose, instead of deliberate prehensile rearrangement of the scene. For each object in a scene, depending on its properties, the robot may or may not be allowed to make contact with, tilt, or topple it. To ensure that these constraints are satisfied during non-prehensile interactions, a planner can query a physics-based simulator to evaluate the complex multi-body interactions caused by robot actions. Unfortunately, it is infeasible to query the simulator for thousands of actions that need to be evaluated in a typical planning problem as each simulation is time-consuming. In this work, we show that (i) manipulation tasks (specifically pick-and-place style tasks from a tabletop or a refrigerator) can often be solved by restricting robot-object interactions to adaptive motion primitives in a plan, (ii) these actions can be incorporated as subgoals within a multi-heuristic search framework, and (iii) limiting interactions to these actions can help reduce the time spent querying the simulator during planning by up to 40x in comparison to baseline algorithms. Our algorithm is evaluated in simulation and in the real-world on a PR2 robot using PyBullet as our physics-based simulator. Supplementary video: \url{https://youtu.be/ABQc7JbeJPM}.
翻译:在杂乱的场景中机器人操作往往需要与对象进行接触丰富的互动。 通过非暴力行动进行互动比较经济。例如,通过其他对象推到想要的握住姿势,而不是故意的先发制人重新排列场景。对于一个场景中的每个对象,机器人可能或可能不允许接触、倾斜或压倒它。为了确保这些限制在非暴力互动中得到满足,计划设计者可以查询基于物理的模拟器来评估机器人行动造成的复杂的多机体相互作用。不幸的是,在每次模拟都耗时,在典型的规划问题中,无法就数千项行动向模拟器查询。在这项工作中,我们显示(i)操纵任务(具体地说从桌面或冰箱选取和定位风格任务)通常可以通过限制基于机器人的相互作用到计划中的适应性运动原始程序,(ii)这些动作可以作为子目标纳入一个多动动的模拟动作。 不幸的是,在模拟过程中,J的模拟模拟过程将用来评估的模型到我们的模拟模型中。 (iii) 将这些模拟模型的逻辑比力比力比力比力比力比力比力比力比力比力 。