Autonomous robots would benefit a lot by gaining the ability to manipulate their environment to solve path planning tasks, known as the Navigation Among Movable Obstacle (NAMO) problem. In this paper, we present a deep reinforcement learning approach for solving NAMO locally, near narrow passages. We train parallel agents in physics simulation using an Advantage Actor-Critic based algorithm with a multi-modal neural network. We present an online policy that is able to push obstacles in a non-axial-aligned fashion, react to unexpected obstacle dynamics in real-time, and solve the local NAMO problem. Experimental validation in simulation shows that the presented approach generalises to unseen NAMO problems in unknown environments. We further demonstrate the implementation of the policy on a real quadrupedal robot, showing that the policy can deal with real-world sensor noises and uncertainties in unseen NAMO tasks.
翻译:自主机器人将获益匪浅,因为他们有能力操控环境,解决路径规划任务,即“可移动障碍导航”问题。在本文中,我们展示了一种深入强化的学习方法,以解决本地、近窄通道的NAMO问题。我们用一种基于优势的Actor-Critic 的算法,利用多式神经网络,在物理模拟中培训平行物剂。我们展示了一种在线政策,它能够以非轴式方式推动障碍,实时对意外障碍动态作出反应,并解决本地的NAMO问题。模拟中的实验性验证表明,所提出的方法概括了在未知环境中看不见的NATO问题。我们进一步展示了对真正四重立的机器人的政策的执行情况,表明该政策可以应对现实世界传感器的噪音和未知的NATO任务中的不确定性。</s>