Imitation learning (IL) algorithms have shown promising results for robots to learn skills from expert demonstrations. However, they need multi-task demonstrations to be provided at once for acquiring diverse skills, which is difficult in real world. In this work we study how to realize continual imitation learning ability that empowers robots to continually learn new tasks one by one, thus reducing the burden of multi-task IL and accelerating the process of new task learning at the same time. We propose a novel trajectory generation model that employs both a generative adversarial network and a dynamics-aware prediction model to generate pseudo trajectories from all learned tasks in the new task learning process. Our experiments on both simulation and real-world manipulation tasks demonstrate the effectiveness of our method.
翻译:模拟学习(IL)算法已经为机器人从专家演示中学习技能展现出有希望的结果,然而,它们需要同时提供多任务演示,以获得在现实世界中很困难的多种技能。在这项工作中,我们研究如何实现连续的模拟学习能力,使机器人能够不断逐项学习新任务,从而减轻多任务学习(IL)的负担,同时加快新任务学习进程。我们提出了一个新的轨迹生成模型,既使用基因对抗网络,又使用动态认知预测模型,从新任务学习过程中的所有学习任务中产生假轨迹。我们关于模拟和现实世界操作任务的实验显示了我们方法的有效性。