Learning generalizable insertion skills in a data-efficient manner has long been a challenge in the robot learning community. While the current state-of-the-art methods with reinforcement learning (RL) show promising performance in acquiring manipulation skills, the algorithms are data-hungry and hard to generalize. To overcome the issues, in this paper we present Prim-LAfD, a simple yet effective framework to learn and adapt primitive-based insertion skills from demonstrations. Prim-LAfD utilizes black-box function optimization to learn and adapt the primitive parameters leveraging prior experiences. Human demonstrations are modeled as dense rewards guiding parameter learning. We validate the effectiveness of the proposed method on eight peg-hole and connector-socket insertion tasks. The experimental results show that our proposed framework takes less than one hour to acquire the insertion skills and as few as fifteen minutes to adapt to an unseen insertion task on a physical robot.
翻译:长期以来,以数据效率方式学习通用插入技能一直是机器人学习界的一个挑战。虽然目前最先进的强化学习方法(RL)在获得操作技能方面表现良好,但算法是数据渴望和难以概括的。为了克服这些问题,我们在本文件中介绍Prim-LAfD,这是一个简单而有效的框架,从演示中学习和改造原始的插入技能。Prim-LAfD利用黑盒功能优化来学习和调整原始参数,利用以往的经验。人类演示以密集的奖赏指导参数学习为模型。我们验证了8个隐形洞和连接器插入任务的拟议方法的有效性。实验结果显示,我们提议的框架需要不到1小时的时间获得插入技能,只有不到15分钟的时间来适应物理机器人的无形插入任务。