Visual Place Recognition (VPR) is often characterized as being able to recognize the same place despite significant changes in appearance and viewpoint. VPR is a key component of Spatial Artificial Intelligence, enabling robotic platforms and intelligent augmentation platforms such as augmented reality devices to perceive and understand the physical world. In this paper, we observe that there are three "drivers" that impose requirements on spatially intelligent agents and thus VPR systems: 1) the particular agent including its sensors and computational resources, 2) the operating environment of this agent, and 3) the specific task that the artificial agent carries out. In this paper, we characterize and survey key works in the VPR area considering those drivers, including their place representation and place matching choices. We also provide a new definition of VPR based on the visual overlap -- akin to spatial view cells in the brain -- that enables us to find similarities and differences to other research areas in the robotics and computer vision fields. We identify numerous open challenges and suggest areas that require more in-depth attention in future works.
翻译:视觉定位(VPR)通常被描述为能够识别同一地点,尽管外观和观点发生了重大变化。 VPR是空间人工智能的关键组成部分,它使机器人平台和智能增强平台成为能够感知和理解物理世界的强化现实装置等智能增强平台。在本文中,我们观察到,有三个“驱动器”对空间智能剂和VPR系统提出了要求:1)特定代理器,包括其传感器和计算资源,2)该代理器的操作环境,3)该人工代理器的具体任务。在本文中,我们描述和调查VPR领域的主要工作,考虑这些驱动器,包括其位置和位置匹配选择。我们还根据视觉重叠(类似于大脑空间观察细胞)对VPR的新定义,使我们能够找到机器人和计算机视觉领域其他研究领域的相似之处和差异。我们确定了许多公开的挑战,并提出在未来工作中需要更深入关注的领域。