In mobile manipulation (MM), robots can both navigate within and interact with their environment and are thus able to complete many more tasks than robots only capable of navigation or manipulation. In this work, we explore how to apply imitation learning (IL) to learn continuous visuo-motor policies for MM tasks. Much prior work has shown that IL can train visuo-motor policies for either manipulation or navigation domains, but few works have applied IL to the MM domain. Doing this is challenging for two reasons: on the data side, current interfaces make collecting high-quality human demonstrations difficult, and on the learning side, policies trained on limited data can suffer from covariate shift when deployed. To address these problems, we first propose Mobile Manipulation RoboTurk (MoMaRT), a novel teleoperation framework allowing simultaneous navigation and manipulation of mobile manipulators, and collect a first-of-its-kind large scale dataset in a realistic simulated kitchen setting. We then propose a learned error detection system to address the covariate shift by detecting when an agent is in a potential failure state. We train performant IL policies and error detectors from this data, and achieve over 45% task success rate and 85% error detection success rate across multiple multi-stage tasks when trained on expert data. Codebase, datasets, visualization, and more available at https://sites.google.com/view/il-for-mm/home.
翻译:在移动操控(MM)中,机器人既可在内部导航,又可与其环境互动,从而能够完成比机器人更多的任务,而不是只能够导航或操纵的机器人。在这项工作中,我们探索如何应用模仿学习(IL)来学习MM任务的连续对动政策。许多先前的工作已经表明,IL可以同时对操纵或导航域进行对动运动政策的培训,但很少有作品对MM域应用IL。这样做具有挑战性:在数据方面,当前接口使得收集高质量的人类演示工作难以进行,而在学习方面,在有限数据方面受过培训的政策在部署时会因共变换而受到影响。为了解决这些问题,我们首先建议采用移动操纵机器人的移动操纵(IIL)学习(IL)学习(IL)(IL)(IL)(IL)(IL)(IL)(IL)(IL)(IL)(IL)(IU)(IMO(I)(IMO(M)(MO(M)(MOL)(MO(M)(M)(ML)(MLUL)(M(M)(M)(M)(M)(M(M))(ML)(ILIL)(IL)(IL)(IL)(IL)(IL)(IL)(IL)(IL)(IL)(M)(M)(IL)(M)(M)(M)(M)(M(M)(IL)(M(M)(M))(ML))(ML)(ML))(ML)(M(M(ML)(ML)))(M)(M)(M)(M(M(M))((M))))))(M)(M)(M(M)(M)(M(M(M(M(M))))((M))))(M(M(M))((M(M(M))((M(M))((M))((M))((M)))((M)))))((M)(M)(M))))(M)(M))((M)(M(M