Ground texture based localization methods are potential prospects for low-cost, high-accuracy self-localization solutions for robots. These methods estimate the pose of a given query image, i.e. the current observation of the ground from a downward-facing camera, in respect to a set of reference images whose poses are known in the application area. In this work, we deal with the initial localization task, in which we have no prior knowledge about the current robot positioning. In this situation, the localization method would have to consider all available reference images. However, in order to reduce computational effort and the risk of receiving a wrong result, we would like to consider only those reference images that are actually overlapping with the query image. For this purpose, we propose a deep metric learning approach that retrieves the most similar reference images to the query image. In contrast to existing approaches to image retrieval for ground images, our approach achieves significantly better recall performance and improves the localization performance of a state-of-the-art ground texture based localization method.
翻译:基于地面纹理的本地化方法是机器人低成本、高精确度自我本地化解决方案的潜在前景。 这些方法估计了特定查询图像的外观, 即从向下摄像头对地面的当前观测, 其外观在应用区已知的一组参考图像。 在这项工作中, 我们处理初始本地化任务, 即我们事先对当前机器人定位没有了解。 在这种情况下, 本地化方法必须考虑所有可用的本地化图像。 但是, 为了减少计算努力和接收错误结果的风险, 我们希望只考虑那些实际上与查询图像重叠的参考图像。 为此, 我们提出一个深度的多指标学习方法, 以检索与查询图像最相似的参考图像。 与现有的地面图像检索方法相比, 我们的方法可以大大改进基于状态地面纹理的本地化方法的绩效并改进本地化绩效。