We study a modular approach to tackle long-horizon mobile manipulation tasks for object rearrangement, which decomposes a full task into a sequence of subtasks. To tackle the entire task, prior work chains multiple stationary manipulation skills with a point-goal navigation skill, which are learned individually on subtasks. Although more effective than monolithic end-to-end RL policies, this framework suffers from compounding errors in skill chaining, e.g., navigating to a bad location where a stationary manipulation skill can not reach its target to manipulate. To this end, we propose that the manipulation skills should include mobility to have flexibility in interacting with the target object from multiple locations and at the same time the navigation skill could have multiple end points which lead to successful manipulation. We operationalize these ideas by implementing mobile manipulation skills rather than stationary ones and training a navigation skill trained with region goal instead of point goal. We evaluate our multi-skill mobile manipulation method M3 on 3 challenging long-horizon mobile manipulation tasks in the Home Assistant Benchmark (HAB), and show superior performance as compared to the baselines.
翻译:我们研究一种模块化方法,以解决用于天体重新定位的长视距移动操纵任务,该模块化方法将一个全部任务分解成一个子任务序列。为了完成整个任务,先前的工作链中包含多个固定操纵技能,带有点目标导航技能,这些技能在子任务上单独学习。虽然比单立端对端RL政策更有效,但这一框架在技能链化方面有多重错误,例如,将固定操作技能拖到一个无法达到其操作目标的坏地点。为此,我们建议操作技能应包括机动性,以便与目标目标物体从多个地点进行互动时具有灵活性,与此同时,导航技能可以具有多个终点,从而导致成功操作。我们通过采用移动操作技能而不是固定式导航技能,以及培训以区域目标而不是点目标为培训的导航技能,来落实这些想法。我们评估我们的多技能移动操纵方法M3,用于挑战主干助理基准(HAB)的长视距移动操纵任务,并显示与基线相比的优劣性。