This paper presents ORB-SLAM3, the first system able to perform visual, visual-inertial and multi-map SLAM with monocular, stereo and RGB-D cameras, using pin-hole and fisheye lens models. The first main novelty is a feature-based tightly-integrated visual-inertial SLAM system that fully relies on Maximum-a-Posteriori (MAP) estimation, even during the IMU initialization phase. The result is a system that operates robustly in real-time, in small and large, indoor and outdoor environments, and is 2 to 5 times more accurate than previous approaches. The second main novelty is a multiple map system that relies on a new place recognition method with improved recall. Thanks to it, ORB-SLAM3 is able to survive to long periods of poor visual information: when it gets lost, it starts a new map that will be seamlessly merged with previous maps when revisiting mapped areas. Compared with visual odometry systems that only use information from the last few seconds, ORB-SLAM3 is the first system able to reuse in all the algorithm stages all previous information. This allows to include in bundle adjustment co-visible keyframes, that provide high parallax observations boosting accuracy, even if they are widely separated in time or if they come from a previous mapping session. Our experiments show that, in all sensor configurations, ORB-SLAM3 is as robust as the best systems available in the literature, and significantly more accurate. Notably, our stereo-inertial SLAM achieves an average accuracy of 3.6 cm on the EuRoC drone and 9 mm under quick hand-held motions in the room of TUM-VI dataset, a setting representative of AR/VR scenarios. For the benefit of the community we make public the source code.
翻译:本文展示了 ORB- SLAM3, 这是第一个能够使用单镜、立体和 RGB-D 相机, 使用针眼和鱼眼镜头模型, 执行视觉、 视觉、 视觉和多映像 SLAMM 系统的第一个系统。 第一个主要创新是基于地貌的、 紧密集成的视觉- 视觉- 线性 SLAM 系统, 完全依赖最大- a- Posteririi (MAP) 估计, 即使在IMU 初始化阶段, 也是第一个系统, 在小型、 大型、 室内和室外环境中, 并且比以往方法更精确。 第二个主要创新是多部地图系统, 依靠新的地点识别方法, 并改进了回顾。 感谢 ORB- SLAM 3 能够生存到长期的视觉信息: 当它丢失时, 将启动新的地图, 在重新绘制区域时, 它会与以前的地图进行无缝的合并。 与仅使用最后几秒钟的视觉- ORB- SARAM 3 系统相比, 它是第一个系统可以重新使用所有精确的精确的系统, 。 在所有算的精确的精确的逻辑观察阶段里程中, 当我们之前的直径级观察过程中, 显示所有的直径序的直径流的直线段的直线上的所有数据都显示, 提供了所有 KLM 。