Robotic navigation has been approached as a problem of 3D reconstruction and planning, as well as an end-to-end learning problem. However, long-range navigation requires both planning and reasoning about local traversability, as well as being able to utilize general knowledge about global geography, in the form of a roadmap, GPS, or other side information providing important cues. In this work, we propose an approach that integrates learning and planning, and can utilize side information such as schematic roadmaps, satellite maps and GPS coordinates as a planning heuristic, without relying on them being accurate. Our method, ViKiNG, incorporates a local traversability model, which looks at the robot's current camera observation and a potential subgoal to infer how easily that subgoal can be reached, as well as a heuristic model, which looks at overhead maps for hints and attempts to evaluate the appropriateness of these subgoals in order to reach the goal. These models are used by a heuristic planner to identify the best waypoint in order to reach the final destination. Our method performs no explicit geometric reconstruction, utilizing only a topological representation of the environment. Despite having never seen trajectories longer than 80 meters in its training dataset, ViKiNG can leverage its image-based learned controller and goal-directed heuristic to navigate to goals up to 3 kilometers away in previously unseen environments, and exhibit complex behaviors such as probing potential paths and backtracking when they are found to be non-viable. ViKiNG is also robust to unreliable maps and GPS, since the low-level controller ultimately makes decisions based on egocentric image observations, using maps only as planning heuristics. For videos of our experiments, please check out our project page https://sites.google.com/view/viking-release.
翻译:机械导航被视为3D的重建和规划问题,也被视为一个端到端学习问题。然而,远程导航既需要规划和推理本地的可穿行性,也需要能够以路线图、GPS或其他提供重要提示的侧面信息的形式,利用全球地理的一般知识。在这项工作中,我们提出一种方法,将学习和规划结合起来,并可以将侧面信息,如示意式路线图、卫星地图和全球定位系统坐标作为规划的超常性,而不必依赖其准确性。我们的方法,ViKiNG, 包含了一个本地可穿行性模型,该模型既要观察机器人当前的摄像头观察,又要考察本地可穿行性可穿行性可穿行性可穿行性可穿行性,而且可能子色化模型也只能用来预测亚历的亚历,尽管我们从未看过高端的GOFO/Clorvices, 也只能用这些模型来确定最稳健的后向最终目的地的路径。我们的方法,我们的方法只能进行明确的几何级重建,而只能使用更深层次的直径直径直达的轨道,在80个方向的轨道上,直到他所了解的轨道上,而他从他所了解的轨道的图像的图像的图像中,在学习的图像上,要到他所学的图像的图像上。