Future planetary missions will rely on rovers that can autonomously explore and navigate in unstructured environments. An essential element is the ability to recognize places that were already visited or mapped. In this work, we leverage the ability of stereo cameras to provide both visual and depth information, guiding the search and validation of loop closures from a multi-modal perspective. We propose to augment submaps that are created by aggregating stereo point clouds, with visual keyframes. Point clouds matches are found by comparing CSHOT descriptors and validated by clustering, while visual matches are established by comparing keyframes using Bag-of-Words (BoW) and ORB descriptors. The relative transformations resulting from both keyframe and point cloud matches are then fused to provide pose constraints between submaps in our graph-based SLAM framework. Using the LRU rover, we performed several tests in both an indoor laboratory environment as well as a challenging planetary analog environment on Mount Etna, Italy. These environments consist of areas where either keyframes or point clouds alone failed to provide adequate matches demonstrating the benefit of the proposed multi-modal approach.
翻译:未来行星飞行任务将依赖能够自主探索和在非结构化环境中导航的流星体。 关键要素之一是能够识别已经访问或绘制过的地点。 在这项工作中,我们利用立体摄像机的能力提供视觉和深度信息,从多模式角度指导环绕封闭的搜索和验证。 我们提议增加通过集成立体点云和视觉键盘而创造的亚分布图。 通过比较CSHOT描述符和集群验证,可以发现点云匹配,而通过比较Wods(BoW)和ORB描述符而建立视觉匹配。 由关键框架和点云匹配而形成的相对变化随后被结合起来,为我们基于图形的 SLAM 框架中的子分布图层之间造成制约。 我们利用LRU rover在意大利埃特纳山的室内实验室环境以及具有挑战性的行星模拟环境进行了几次测试。 这些环境包括要么是关键框架,要么是点云光线光无法提供足够匹配的拟议多模式方法的好处的地区。