The ability to learn from human demonstration endows robots with the ability to automate various tasks. However, directly learning from human demonstration is challenging since the structure of the human hand can be very different from the desired robot gripper. In this work, we show that manipulation skills can be transferred from a human to a robot through the use of micro-evolutionary reinforcement learning, where a five-finger human dexterous hand robot gradually evolves into a commercial robot, while repeated interacting in a physics simulator to continuously update the policy that is first learned from human demonstration. To deal with the high dimensions of robot parameters, we propose an algorithm for multi-dimensional evolution path searching that allows joint optimization of both the robot evolution path and the policy. Through experiments on human object manipulation datasets, we show that our framework can efficiently transfer the expert human agent policy trained from human demonstrations in diverse modalities to target commercial robots.
翻译:向人类演示的内脏机器人学习的能力能够使各种任务自动化。 但是,从人类演示中直接学习具有挑战性,因为人类的手结构可能与想要的机器人握手者大不相同。 在这项工作中,我们表明操纵技能可以通过利用微进化强化学习从人类向机器人转移,在这种学习中,五指人类的半手机器人逐渐演变为商业机器人,同时在物理学模拟器中反复互动,不断更新最初从人类演示中学习的政策。为了处理机器人参数的高维度,我们提出了多维进化路径的算法,以便共同优化机器人的进化路径和政策。通过对人类物体操纵数据集的实验,我们表明我们的框架可以有效地将从人类演示中以不同方式训练的人类代理人政策转让给商业机器人。