6D pose recognition has been a crucial factor in the success of robotic grasping, and recent deep learning based approaches have achieved remarkable results on benchmarks. However, their generalization capabilities in real-world applications remain unclear. To overcome this gap, we introduce 6IMPOSE, a novel framework for sim-to-real data generation and 6D pose estimation. 6IMPOSE consists of four modules: First, a data generation pipeline that employs the 3D software suite Blender to create synthetic RGBD image datasets with 6D pose annotations. Second, an annotated RGBD dataset of five household objects generated using the proposed pipeline. Third, a real-time two-stage 6D pose estimation approach that integrates the object detector YOLO-V4 and a streamlined, real-time version of the 6D pose estimation algorithm PVN3D optimized for time-sensitive robotics applications. Fourth, a codebase designed to facilitate the integration of the vision system into a robotic grasping experiment. Our approach demonstrates the efficient generation of large amounts of photo-realistic RGBD images and the successful transfer of the trained inference model to robotic grasping experiments, achieving an overall success rate of 87% in grasping five different household objects from cluttered backgrounds under varying lighting conditions. This is made possible by the fine-tuning of data generation and domain randomization techniques, and the optimization of the inference pipeline, overcoming the generalization and performance shortcomings of the original PVN3D algorithm. Finally, we make the code, synthetic dataset, and all the pretrained models available on Github.
翻译:6D 表面上的承认是机器人成功捕捉的一个关键因素,而最近的深层次基于学习的方法在基准方面取得了显著的成果。然而,它们在现实世界应用中的普及能力仍然不明确。为了克服这一差距,我们引入了6IMPOSE,一个用于模拟到真实数据生成和6D 的估算的新框架。 6IMPOSE由四个模块组成:第一,一个数据生成管道,使用3D软件套装件Blender来创建合成的 RGBD 图像数据集,并配有6D 构成说明。第二,一个附加说明的 RGBD 数据集,由5个使用拟议管道生成的家用物体组成。第三,一个实时的二级6D 显示估算方法,将天体探测器YOLO-V4和6D 的简化的实时估算算法用于具有时间敏感性的机器人生成应用程序。 第四,一个代码库,旨在便利将视觉系统整合成一个机器人捕捉实验。 我们的方法展示了大量摄影现实RGBD图像的高效生成,并且将经过培训的当前版版版版版本数据转换成一个跨版的版本。</s>