Multi-camera systems have been shown to improve the accuracy and robustness of SLAM estimates, yet state-of-the-art SLAM systems predominantly support monocular or stereo setups. This paper presents a generic sparse visual SLAM framework capable of running on any number of cameras and in any arrangement. Our SLAM system uses the generalized camera model, which allows us to represent an arbitrary multi-camera system as a single imaging device. Additionally, it takes advantage of the overlapping fields of view (FoV) by extracting cross-matched features across cameras in the rig. This limits the linear rise in the number of features with the number of cameras and keeps the computational load in check while enabling an accurate representation of the scene. We evaluate our method in terms of accuracy, robustness, and run time on indoor and outdoor datasets that include challenging real-world scenarios such as narrow corridors, featureless spaces, and dynamic objects. We show that our system can adapt to different camera configurations and allows real-time execution for typical robotic applications. Finally, we benchmark the impact of the critical design parameters - the number of cameras and the overlap between their FoV that define the camera configuration for SLAM. All our software and datasets are freely available for further research.
翻译:多相机系统已经显示,可以提高SLAM估计数的准确性和稳健性,然而,最先进的SLAM系统主要支持单视或立体设置,本文展示了一个通用的少见的SLAM框架,能够在任何摄像头上和在任何安排中运行。我们的SLAM系统使用通用的摄像模型,这使我们能够将一个任意的多相机系统作为单一成像装置来代表。此外,它利用了重叠的视野领域(FoV),在钻机中截取各相机的交叉比对功能。这限制了功能数量的直线增长,使其与照相机的数量保持同步,并保持计算负荷的检查,同时能够准确地显示场景。我们从准确性、稳健性的角度评价我们的方法,并在室内和室外数据集上运行时间,其中包括具有挑战性的地貌情景,例如窄走廊、无特色的空间和动态物体。我们显示,我们的系统可以适应不同的相机配置,并允许对典型的机器人应用进行实时执行。最后,我们衡量关键设计参数的影响,即相机的数目及其Fov软件之间的重叠,以便进一步定义我们现有的摄影机床。