Picking cluttered general objects is a challenging task due to the complex geometries and various stacking configurations. Many prior works utilize pose estimation for picking, but pose estimation is difficult on cluttered objects. In this paper, we propose Cluttered Objects Descriptors (CODs), a dense cluttered objects descriptor that can represent rich object structures, and use the pre-trained CODs network along with its intermediate outputs to train a picking policy. Additionally, we train the policy with reinforcement learning, which enable the policy to learn picking without supervision. We conduct experiments to demonstrate that our CODs is able to consistently represent seen and unseen cluttered objects, which allowed for the picking policy to robustly pick cluttered general objects. The resulting policy can pick 96.69% of unseen objects in our experimental environment which is twice as cluttered as the training scenarios.
翻译:拾取混乱的一般物体是一项具有挑战性的任务,由于其复杂的几何形状和各种堆叠配置。许多先前的作品使用姿态估计进行拾取,但在混乱的物体上进行姿态估计是困难的。在本文中,我们提出了 Cluttered Objects Descriptors(CODs)——一种密集的混乱物体描述符,可以代表丰富的物体结构,使用预先训练的 CODs 网络及其中间输出训练拾取策略。此外,我们使用强化学习训练策略,使策略能够在没有监督的情况下学习拾取。我们进行了实验,证明了我们的 CODs 能够一致地表示已知和未知的混乱物体,使拾取策略能够稳健地拾取混乱的一般物体。所得到的策略可以在实验环境中拾取96.69%的未知对象,该环境是训练场景的两倍混乱。