Intelligent robot grasping is a very challenging task due to its inherent complexity and non availability of sufficient labelled data. Since making suitable labelled data available for effective training for any deep learning based model including deep reinforcement learning is so crucial for successful grasp learning, in this paper we propose to solve the problem of generating grasping Poses/Rectangles using a Pix2Pix Generative Adversarial Network (Pix2Pix GAN), which takes an image of an object as input and produces the grasping rectangle tagged with the object as output. Here, we have proposed an end-to-end grasping rectangle generating methodology and embedding it to an appropriate place of an object to be grasped. We have developed two modules to obtain an optimal grasping rectangle. With the help of the first module, the pose (position and orientation) of the generated grasping rectangle is extracted from the output of Pix2Pix GAN, and then the extracted grasp pose is translated to the centroid of the object, since here we hypothesize that like the human way of grasping of regular shaped objects, the center of mass/centroids are the best places for stable grasping. For other irregular shaped objects, we allow the generated grasping rectangles as it is to be fed to the robot for grasp execution. The accuracy has significantly improved for generating the grasping rectangle with limited number of Cornell Grasping Dataset augmented by our proposed approach to the extent of 87.79%. Experiments show that our proposed generative model based approach gives the promising results in terms of executing successful grasps for seen as well as unseen objects.
翻译:智能机器人掌握是一个非常具有挑战性的任务, 因为它具有内在的复杂性, 且没有足够贴标签的数据 。 由于为任何深学习基于模型的有效培训提供合适的标签数据, 包括深强化学习对于成功掌握学习如此关键, 我们在本文件中建议解决使用 Pix2Pix Pix 基因反转网络( Pix2Pix Generation Aversarial Network (Pix2Pix Pix GAN) 生成抓取定位器的问题, Pix2Pix GAN) 将一个对象的图像作为输入, 并生成与该对象输出相连接的抓取矩形格。 在此, 我们提议了一个端到端到端的抓取矩形矩形, 将其嵌入一个合适的对象位置。 我们已经开发了两个模块, 以获得最佳的抓角矩形矩形。 在第一个模块的帮助下, 将生成的矩形( 位置和方向) 将所生成的矩形( 位置) 转换成一个更亮的立形图形法, 因为在这里, 我们用人类的方法来捕捉取固定的矩形的矩形的矩形矩形的矩形的矩形方法 。 将固定的矩形显示的矩形的矩形的矩形, 将显示为稳定的矩形的矩形的矩形的缩成为稳定的矩形, 的正形的缩的正形的正形的正形的缩成成成成成成成的矩形, 。