Visual localization techniques often comprise a hierarchical localization pipeline, with a visual place recognition module used as a coarse localizer to initialize a pose refinement stage. While improving the pose refinement step has been the focus of much recent research, most work on the coarse localization stage has focused on improvements like increased invariance to appearance change, without improving what can be loose error tolerances. In this letter, we propose two methods which adapt image retrieval techniques used for visual place recognition to the Bayesian state estimation formulation for localization. We demonstrate significant improvements to the localization accuracy of the coarse localization stage using our methods, whilst retaining state-of-the-art performance under severe appearance change. Using extensive experimentation on the Oxford RobotCar dataset, results show that our approach outperforms comparable state-of-the-art methods in terms of precision-recall performance for localizing image sequences. In addition, our proposed methods provides the flexibility to contextually scale localization latency in order to achieve these improvements. The improved initial localization estimate opens up the possibility of both improved overall localization performance and modified pose refinement techniques that leverage this improved spatial prior.
翻译:视觉本地化技术通常包括一个等级级本地化管道,其视觉位置识别模块用作粗略的本地化器,以启动一个组合化阶段。改进组合化步骤是最近许多研究的重点,而改进组合化步骤是大部分研究的重点,但粗略本地化阶段的大部分工作侧重于改进,如增加对外观变化的偏差,而没有改进松散的错误容忍度。在此信中,我们建议了两种方法,将用于视觉定位的图像检索技术与巴伊西亚州本地化估算公式相适应。我们展示了使用我们的方法大大改进粗略本地化阶段的本地化准确性,同时在严重外观变化的情况下保留了最先进的性能。在牛津机器人汽车数据集上进行的广泛实验表明,我们的方法在本地化图像序列的精确-回调性能方面优于可比的先进方法。此外,我们提出的方法为根据具体情况缩放本地化拉特度以实现这些改进提供了灵活性。改进的初始本地化估计使得总体本地化绩效得到改进,并改进了组合化技术,从而利用了这一改进后的空间先前的改进。