Robots can effectively grasp and manipulate objects using their 3D models. In this paper, we propose a simple shape representation and a reconstruction method that outperforms state-of-the-art methods in terms of geometric metrics and enables grasp generation with high precision and success. Our reconstruction method models the object geometry as a pair of depth images, composing the "shell" of the object. This representation allows using image-to-image residual ConvNet architectures for 3D reconstruction, generates object reconstruction directly in the camera frame, and generalizes well to novel object types. Moreover, an object shell can be converted into an object mesh in a fraction of a second, providing time and memory efficient alternative to voxel or implicit representations. We explore the application of shell representation for grasp planning. With rigorous experimental validation, both in simulation and on a real setup, we show that shell reconstruction encapsulates sufficient geometric information to generate precise grasps and the associated grasp quality with over 90% accuracy. Diverse grasps computed on shell reconstructions allow the robot to select and execute grasps in cluttered scenes with more than 93% success rate.
翻译:机器人可以使用它们的 3D 模型有效地掌握和操控天体 。 在本文中, 我们提出一个简单的形状表示法和一种重建方法, 在几何测量度上优于最先进的方法, 并且能够以高度精确和成功的方式实现触摸生成。 我们的重建方法将天体几何模型作为一对深度图像来模型, 组成天体的“ 壳状” 。 这个表示法允许在 3D 重建中使用图像到图像的残余 ConNet 结构, 直接在摄像框架中生成对象重建, 并且将物体的重建概括到新的物体类型 。 此外, 一个天体外壳可以在第二小部分中转换成物体网格, 提供有效的时间和记忆替代 voxel 或隐含的表达方式。 我们探索将天体表示法应用来捕捉规划。 我们通过严格的实验验证, 在模拟和真实的设置中, 我们展示壳体再组合足够的几何测量信息, 来产生精确的捕捉取结果, 和相关的抓取质量超过 90% 。 在 罐壳体重建中计算得 。 在 罐壳体重建中计算得 计算得 使机器人能够以超过 超过 93%的成功率 。