Vision-based sensors have shown significant performance, accuracy, and efficiency gain in Simultaneous Localization and Mapping (SLAM) systems in recent years. In this regard, Visual Simultaneous Localization and Mapping (VSLAM) methods refer to the SLAM approaches that employ cameras for pose estimation and map generation. We can see many research works that demonstrated VSLAMs can outperform traditional methods, which rely only on a particular sensor, such as a Lidar, even with lower costs. VSLAM approaches utilize different camera types (e.g., monocular, stereo, and RGB-D), have been tested on various datasets (e.g., KITTI, TUM RGB-D, and EuRoC) and in dissimilar environments (e.g., indoors and outdoors), and employ multiple algorithms and methodologies to have a better understanding of the environment. The mentioned variations have made this topic popular for researchers and resulted in a wide range of VSLAMs methodologies. In this regard, the primary intent of this survey is to present the recent advances in VSLAM systems, along with discussing the existing challenges and trends. We have given an in-depth literature survey of forty-five impactful papers published in the domain of VSLAMs. We have classified these manuscripts by different characteristics, including the novelty domain, objectives, employed algorithms, and semantic level. We also discuss the current trends and future directions that may help researchers investigate them.
翻译:视觉感测器近年来在同步本地化和绘图系统(SLAM)中表现出了显著的性能、准确性和效率,在这方面,视觉同步本地化和绘图方法(VSLAM)是指SLM方法,采用照相机进行估测和绘制地图。我们可以看到许多显示VSLAMs的研究工作能够超越传统方法,这些方法只依赖于特定的传感器,如Lidar,甚至成本较低。VSLAM方法使用不同的照相机类型(如单镜、立体和RGB-D),在各种数据集(如KITTI、TUM RGB-D和EuRoC)和不同环境中(如室内和室外)进行了测试,并采用多种算法和方法来更好地了解环境。上述变异使这个专题对研究人员很受欢迎,并产生了广泛的VSLAM方法。在这方面,本调查的主要目的是介绍VSLAM系统的最新进展,包括KITTI、T-D、TUD和EuRoC)以及不同环境(如室内和室)的当前趋势,我们从深度研究中了解了目前的趋势和历史结构图。