终身地表人文视觉导航 (Lifelong Topological Visual Navigation)

The ability for a robot to navigate with only the use of vision is appealing due to its simplicity. Traditional vision-based navigation approaches required a prior map-building step that was arduous and prone to failure, or could only exactly follow previously executed trajectories. Newer learning-based visual navigation techniques reduce the reliance on a map and instead directly learn policies from image inputs for navigation. There are currently two prevalent paradigms: end-to-end approaches forego the explicit map representation entirely, and topological approaches which still preserve some loose connectivity of the space. However, while end-to-end methods tend to struggle in long-distance navigation tasks, topological map-based solutions are prone to failure due to spurious edges in the graph. In this work, we propose a learning-based topological visual navigation method with graph update strategies that improve lifelong navigation performance over time. We take inspiration from sampling-based planning algorithms to build image-based topological graphs, resulting in sparser graphs yet with higher navigation performance compared to baseline methods. Also, unlike controllers that learn from fixed training environments, we show that our model can be finetuned using a relatively small dataset from the real-world environment where the robot is deployed. We further assess performance of our system in real-world deployments.

翻译：仅使用视野的机器人导航的能力因其简单性而具有吸引力。传统的基于视觉的导航方法需要事先的地图建设步骤,该步骤是艰苦的,容易失败,或者只能完全遵循先前执行的轨迹。新的基于学习的视觉导航技术减少了对地图的依赖,而是直接从导航的图像输入中学习政策。目前有两个流行的范式:端到端方法完全放弃清晰的地图代表,以及仍然保持空间某些松散连接的地形学方法。然而,虽然端到端方法倾向于在远程导航任务中挣扎,但基于地形的地图解决方案容易因图中的虚幻边缘而失败。在这项工作中,我们建议一种基于学习的基于地形的视觉导航方法,其图形更新战略将随着时间的推移提高终生导航的性能。我们从基于取样的规划算法中获得灵感,以建立基于图像的表层图,结果是较弱的导航性能比基线方法还要高。此外,与从固定的训练环境中学习的控制者不同,我们显示我们的模型可以使用相对小的性能在现实世界中进行我们所部署的系统。