Collision-free, goal-directed navigation in environments containing unknown static and dynamic obstacles is still a great challenge, especially when manual tuning of navigation policies or costly motion prediction needs to be avoided. In this paper, we therefore propose a subgoal-driven hierarchical navigation architecture that is trained with deep reinforcement learning and decouples obstacle avoidance and motor control. In particular, we separate the navigation task into the prediction of the next subgoal position for avoiding collisions while moving toward the final target position, and the prediction of the robot's velocity controls. By relying on 2D lidar, our method learns to avoid obstacles while still achieving goal-directed behavior as well as to generate low-level velocity control commands to reach the subgoals. In our architecture, we apply the attention mechanism on the robot's 2D lidar readings and compute the importance of lidar scan segments for avoiding collisions. As we show in simulated and real-world experiments with a Turtlebot robot, our proposed method leads to smooth and safe trajectories among humans and significantly outperforms a state-of-the-art approach in terms of success rate. A supplemental video describing our approach is available online.
翻译:在含有未知静态和动态障碍的环境中,在有未知静态和动态障碍的环境中,目标定向导航仍是一项巨大挑战,特别是在需要避免对导航政策进行手工调整或进行昂贵的动作预测时,我们因此在本文件中提议一个次级目标驱动的等级导航结构,经过深加学习和分解障碍避免和运动控制的培训。特别是,我们将导航任务分为下一个次级目标位置的预测,以避免碰撞,同时向最后目标位置移动,以及预测机器人的速度控制。我们的方法依靠2D Lidar,学会在仍然实现目标导向行为的同时避免障碍,并生成达到次级目标的低速度控制命令。在我们的结构中,我们把注意力机制应用于机器人的2D Lidar读数,并理解Lidar扫描段对于避免碰撞的重要性。我们在与Turtobot机器人的模拟和现实世界实验中显示,我们提议的方法导致人类之间平稳和安全的轨迹,并大大超越了在线成功率的状态方法。</s>