Non-prehensile multi-object rearrangement is a robotic task of planning feasible paths and transferring multiple objects to their predefined target poses without grasping. It needs to consider how each object reaches the target and the order of object movement, which significantly deepens the complexity of the problem. To address these challenges, we propose a hierarchical policy to divide and conquer for non-prehensile multi-object rearrangement. In the high-level policy, guided by a designed policy network, the Monte Carlo Tree Search efficiently searches for the optimal rearrangement sequence among multiple objects, which benefits from imitation and reinforcement. In the low-level policy, the robot plans the paths according to the order of path primitives and manipulates the objects to approach the goal poses one by one. We verify through experiments that the proposed method can achieve a higher success rate, fewer steps, and shorter path length compared with the state-of-the-art.
翻译:非致命性多物体重新排列是一项机器人任务,即规划可行的路径,将多个物体转移到预先确定的目标位置,而无需掌握。它需要考虑每个物体如何达到目标以及物体移动的顺序,这大大加深了问题的复杂性。为了应对这些挑战,我们建议采取分级政策,为非致命性多物体重新排列进行划分和征服。在高层次政策中,在设计的政策网络蒙特卡洛树搜索的指导下,有效搜索多个物体的最佳重新排列顺序,这得益于模仿和强化。在低层次政策中,机器人根据原始路径的顺序规划路径,并逐个地操纵接近目标的物体。我们通过实验核实,拟议的方法能够取得更高的成功率、更少的步骤和较短的路径长度。