Excavation of irregular rigid objects in clutter, such as fragmented rocks and wood blocks, is very challenging due to their complex interaction dynamics and highly variable geometries. In this paper, we adopt reinforcement learning (RL) to tackle this challenge and learn policies to plan for a sequence of excavation trajectories for irregular rigid objects, given point clouds of excavation scenes. Moreover, we separately learn a compact representation of the point cloud on geometric tasks that do not require human labeling. We show that using the representation reduces training time for RL, while achieving similar asymptotic performance compare to an end-to-end RL algorithm. When using a policy trained in simulation directly on a real scene, we show that the policy trained with the representation outperforms end-to-end RL. To our best knowledge, this paper presents the first application of RL to plan a sequence of excavation trajectories of irregular rigid objects in clutter.
翻译:碎裂的岩石和木块等非常规硬性物体的挖掘由于复杂的互动动态和高度可变的几何特征而非常具有挑战性。在本文中,我们采用强化学习(RL)来应对这一挑战,并学习政策来规划对非常规硬性物体的一系列挖掘轨迹,给定了挖掘场景的点云。此外,我们分别从不要求人类标签的几何任务中了解到点云的缩略图。我们显示,使用该表达式减少了RL的训练时间,同时取得了与终端到终端的RL算法相似的零弹性能。当我们使用直接在真实场上进行模拟训练的政策时,我们展示了受培训的关于代表的策略优于终端到终端的RL的政策。据我们所知,本文首次应用RL来规划对非常规硬性物体的挖掘轨迹序列。