This paper presents a self-supervised Learning from Learned Hallucination (LfLH) method to learn fast and reactive motion planners for ground and aerial robots to navigate through highly constrained environments. The recent Learning from Hallucination (LfH) paradigm for autonomous navigation executes motion plans by random exploration in completely safe obstacle-free spaces, uses hand-crafted hallucination techniques to add imaginary obstacles to the robot's perception, and then learns motion planners to navigate in realistic, highly-constrained, dangerous spaces. However, current hand-crafted hallucination techniques need to be tailored for specific robot types (e.g., a differential drive ground vehicle), and use approximations heavily dependent on certain assumptions (e.g., a short planning horizon). In this work, instead of manually designing hallucination functions, LfLH learns to hallucinate obstacle configurations, where the motion plans from random exploration in open space are optimal, in a self-supervised manner. LfLH is robust to different robot types and does not make assumptions about the planning horizon. Evaluated in both simulated and physical environments with a ground and an aerial robot, LfLH outperforms or performs comparably to previous hallucination approaches, along with sampling- and optimization-based classical methods.
翻译:本文介绍了一种自我监督的 " 学习幻觉 " 方法,用于学习地面和空中机器人快速和被动的运动规划者,以在高度受限的环境中航行。最近的自主导航的 " 幻觉 " 模式(LfH)模式(LfH)通过在完全安全的无障碍空间随机探索执行运动计划,使用手工制作的幻觉技术来增加机器人感知的假想障碍,然后学习运动规划者如何在现实、高度受限制和危险的空间航行。然而,目前手工制作的幻觉技术需要针对特定的机器人类型(例如,不同的地面驱动器)进行设计,并使用高度依赖某些假设的近似值(例如,短规划地平线)。在这一工作中,LfLH不是手工设计幻觉功能,而是学习幻觉障碍配置,在开放空间进行随机探索的移动计划是最佳的,是自我超强的。LfLHH对不同的机器人类型是强大的,并且不根据规划地平面的假设。在模拟和物理环境中,用地面和空中的模拟和空中的升级方法对模拟环境进行了评估。