Existing spatial localization techniques for autonomous vehicles mostly use a pre-built 3D-HD map, often constructed using a survey-grade 3D mapping vehicle, which is not only expensive but also laborious. This paper shows that by using an off-the-shelf high-definition satellite image as a ready-to-use map, we are able to achieve cross-view vehicle localization up to a satisfactory accuracy, providing a cheaper and more practical way for localization. Although the idea of using satellite images for cross-view localization is not new, previous methods almost exclusively treat the task as image retrieval, namely matching a vehicle-captured ground-view image with the satellite image. This paper presents a novel cross-view localization method, which departs from the common wisdom of image retrieval. Specifically, our method develops (1) a Geometric-align Feature Extractor (GaFE) that leverages measured 3D points to bridge the geometric gap between ground view and overhead view, (2) a Pose Aware Branch (PAB) adopting a triplet loss to encourage pose-aware feature extracting, and (3) a Recursive Pose Refine Branch (RPRB) using the Levenberg-Marquardt (LM) algorithm to align the initial pose towards the true vehicle pose iteratively. Our method is validated on KITTI and Ford Multi-AV Seasonal datasets as ground view and Google Maps as the satellite view. The results demonstrate the superiority of our method in cross-view localization with spatial and angular errors within 1 meter and $2^\circ$, respectively. The code will be made publicly available.
翻译:自主车辆的现有空间本地化技术大多使用预先建造的 3D-HD 地图,通常使用测量级 3D 地面图像,不仅昂贵,而且费解。本文显示,通过使用现成的高清晰卫星图像作为现成使用地图,我们能够实现交叉视图车辆本地化,达到令人满意的准确度,为定位提供更便宜和更实用的方法。虽然使用卫星图像进行交叉浏览本地化的想法并不新鲜,但以往的方法几乎完全将任务作为图像检索,即将车辆捕获的地面图像与卫星图像相匹配。本文展示了一种创新的交叉视图本地化方法,这与图像检索的共同智慧不同。具体地说,我们的方法开发了(1) 测量了3D点,以缩小地面视图和间接视图之间的几何差距,(2) 了解Pose(PaB) 采用了三重成本成本成本法,以鼓励图像采集地貌特征,以及(3) 重新精确的地面定位(Mare-Reval-Rest) 将实时平流路段的图像和图像定位方法分别用于实时地平流路段的平流法。