Where am I? This is one of the most critical questions that any intelligent system should answer to decide whether it navigates to a previously visited area. This problem has long been acknowledged for its challenging nature in simultaneous localization and mapping (SLAM), wherein the robot needs to correctly associate the incoming sensory data to the database allowing consistent map generation. The significant advances in computer vision achieved over the last 20 years, the increased computational power, and the growing demand for long-term exploration contributed to efficiently performing such a complex task with inexpensive perception sensors. In this article, visual loop closure detection, which formulates a solution based solely on appearance input data, is surveyed. We start by briefly introducing place recognition and SLAM concepts in robotics. Then, we describe a loop closure detection system's structure, covering an extensive collection of topics, including the feature extraction, the environment representation, the decision-making step, and the evaluation process. We conclude by discussing open and new research challenges, particularly concerning the robustness in dynamic environments, the computational complexity, and scalability in long-term operations. The article aims to serve as a tutorial and a position paper for newcomers to visual loop closure detection.
翻译:我在哪里?这是任何智能系统应该解答的最关键问题之一,以决定它是否指向以前访问过的地区。这个问题长期以来一直被确认为在同时定位和绘图(SLAM)中具有挑战性,机器人需要正确地将收到的感官数据与数据库联系起来,以便能够以一致的方式绘制地图。过去20年来在计算机视野方面取得的重大进步,计算能力的增长,以及对长期勘探的日益增长的需求,都有助于以廉价的感知传感器高效率地完成如此复杂的任务。在文章中,只根据外观输入数据制定解决方案的视觉环闭探测被调查被调查。我们首先在机器人中简短地介绍地点识别和SLAM概念。然后,我们描述环闭探测系统的结构,包括广泛收集专题,包括特征提取、环境代表、决策步骤和评价过程。我们最后通过讨论开放和新的研究挑战,特别是动态环境中的稳健性、计算复杂性和长期操作的可缩缩略性。文章的目的是作为新者观察视觉环路探测的缩略图和立场文件。