LiDAR-based place recognition is one of the key components of SLAM and global localization in autonomous vehicles and robotics applications. With the success of DL approaches in learning useful information from 3D LiDARs, place recognition has also benefited from this modality, which has led to higher re-localization and loop-closure detection performance, particularly, in environments with significant changing conditions. Despite the progress in this field, the extraction of proper and efficient descriptors from 3D LiDAR data that are invariant to changing conditions and orientation is still an unsolved challenge. To address this problem, this work proposes a novel 3D LiDAR-based deep learning network (named AttDLNet) that uses a range-based proxy representation for point clouds and an attention network with stacked attention layers to selectively focus on long-range context and inter-feature relationships. The proposed network is trained and validated on the KITTI dataset and an ablation study is presented to assess the novel attention network. Results show that adding attention to the network improves performance, leading to efficient loop closures, and outperforming an established 3D LiDAR-based place recognition approach. From the ablation study, results indicate that the middle encoder layers have the highest mean performance, while deeper layers are more robust to orientation change. The code is publicly available at https://github.com/Cybonic/AttDLNet
翻译:以LiDAR为基础的地点识别是SLAM和自主飞行器和机器人应用中全球本地化的关键组成部分之一。由于DL成功地从 3D LiDARs 中学习有用信息,地点识别也得益于这一模式,这导致更高度的重新定位和循环闭路检测性能,特别是在条件发生重大变化的环境中。尽管在这一领域取得了进展,但从3D LiDAR数据中提取与不断变化的条件和方向不相适应的适当和有效的描述符仍然是一个尚未解决的挑战。为解决这一问题,这项工作提议了一个新的 3D LiDAR 深层学习网络(名为AttDLNet),在点云和关注网络中使用基于范围的代用代表,并堆叠关注层,有选择地侧重于远程背景和互联关系。拟议的网络在 KITTI 数据集上得到培训和验证,并进行缩略图研究,以评估新的关注网络。结果显示,对网络的注意提高了绩效,导致高效循环关闭,并在3D Net 深层次上执行一个基于 3D 高级LALA 的公开定位,显示,而以最高层次的成绩识别方式是高水平。