Autonomous grasping is challenging due to the high computational cost caused by multi-fingered robotic hands and their interactions with objects. Various analytical methods have been developed yet their high computational cost limits the adoption in real-world applications. Learning-based grasping can afford real-time motion planning thanks to its high computational efficiency. However, it needs to explore large search spaces during its learning process. The search space causes low learning efficiency, which has been the main barrier to its practical adoption. In this work, we develop a novel Physics-Guided Deep Reinforcement Learning with a Hierarchical Reward Mechanism, which combines the benefits of both analytical methods and learning-based methods for autonomous grasping. Different from conventional observation-based grasp learning, physics-informed metrics are utilized to convey correlations between features associated with hand structures and objects to improve learning efficiency and learning outcomes. Further, a hierarchical reward mechanism is developed to enable the robot to learn the grasping task in a prioritized way. It is validated in a grasping task with a MICO robot arm in simulation and physical experiments. The results show that our method outperformed the baseline in task performance by 48% and learning efficiency by 40%.
翻译:自主掌握具有挑战性,因为多指机器人手及其与物体的相互作用造成了高昂的计算成本。已经开发了各种分析方法,但它们的高计算成本限制了实际应用的采用。基于学习的掌握能够支付实时运动规划费用,因为其计算效率很高。然而,它需要在其学习过程中探索巨大的搜索空间。搜索空间导致学习效率低,这是其实际应用的主要障碍。在这项工作中,我们开发了一个新型的物理指导深强化学习,并有一个等级机制,将分析方法和学习方法的效益结合起来,以便自主掌握。与传统的观察掌握学习不同,物理知情的计量标准被用来传递与手结构及物体相关功能之间的相互关系,以提高学习效率和学习成果。此外,还开发了一个等级奖励机制,使机器人能够以优先的方式学习掌握任务。在模拟和物理实验中,我们与MICO机器人的机械臂一起掌握的任务得到了验证。结果显示,我们的方法比任务执行基准高出了48%,通过学习效率超过40%。