We propose a learning-based navigation system for reaching visually indicated goals and demonstrate this system on a real mobile robot platform. Learning provides an appealing alternative to conventional methods for robotic navigation: instead of reasoning about environments in terms of geometry and maps, learning can enable a robot to learn about navigational affordances, understand what types of obstacles are traversable (e.g., tall grass) or not (e.g., walls), and generalize over patterns in the environment. However, unlike conventional planning algorithms, it is harder to change the goal for a learned policy during deployment. We propose a method for learning to navigate towards a goal image of the desired destination. By combining a learned policy with a topological graph constructed out of previously observed data, our system can determine how to reach this visually indicated goal even in the presence of variable appearance and lighting. Three key insights, waypoint proposal, graph pruning and negative mining, enable our method to learn to navigate in real-world environments using only offline data, a setting where prior methods struggle. We instantiate our method on a real outdoor ground robot and show that our system, which we call ViNG, outperforms previously-proposed methods for goal-conditioned reinforcement learning, including other methods that incorporate reinforcement learning and search. We also study how \sysName generalizes to unseen environments and evaluate its ability to adapt to such an environment with growing experience. Finally, we demonstrate ViNG on a number of real-world applications, such as last-mile delivery and warehouse inspection. We encourage the reader to visit the project website for videos of our experiments and demonstrations sites.google.com/view/ving-robot.
翻译:我们提出一个基于学习的导航系统,以达到视觉显示的目标,并在真正的移动机器人平台上展示这个系统。学习为机器人导航提供了一种替代常规方法的诱人方法:学习不是从几何和地图的角度对环境进行推理,学习可以让机器人了解导航价格,了解哪些类型的障碍是可以穿行的(如高草)或不是(如高墙),并泛化环境模式。然而,与传统的规划算法不同,改变在部署期间学习政策的目标更为困难。我们提出了一种学习走向理想目的地目标图像的方法。我们将学习的政策与由先前观察到的数据构造的地形图相结合,我们系统就可以确定如何达到这一直观显示的目标,即使有变异的外观和光照,我们也可以理解哪些障碍(如高草草 ), 了解哪些类型的障碍(如高草坪 ), 以及哪些类型的障碍(如墙壁壁 ), 使我们的方法能够学习在现实世界环境中行走动, 仅使用离线数据, 也就是在以往方法上挣扎。我们对真正的户外机器人进行即我们的方法访问的方法, 并展示我们的系统, 显示我们的系统,我们叫VING+,我们称之为搜索,我们称之为搜索系统,我们称之为搜索,我们如何学习了它,, 学习如何加强环境, 学习了我们如何学习。