Recent learning-based LiDAR odometry methods have demonstrated their competitiveness. However, most methods still face two substantial challenges: 1) the 2D projection representation of LiDAR data cannot effectively encode 3D structures from the point clouds; 2) the needs for a large amount of labeled data for training limit the application scope of these methods. In this paper, we propose a self-supervised LiDAR odometry method, dubbed SelfVoxeLO, to tackle these two difficulties. Specifically, we propose a 3D convolution network to process the raw LiDAR data directly, which extracts features that better encode the 3D geometric patterns. To suit our network to self-supervised learning, we design several novel loss functions that utilize the inherent properties of LiDAR point clouds. Moreover, an uncertainty-aware mechanism is incorporated in the loss functions to alleviate the interference of moving objects/noises. We evaluate our method's performances on two large-scale datasets, i.e., KITTI and Apollo-SouthBay. Our method outperforms state-of-the-art unsupervised methods by 27%/32% in terms of translational/rotational errors on the KITTI dataset and also performs well on the Apollo-SouthBay dataset. By including more unlabelled training data, our method can further improve performance comparable to the supervised methods.
翻译:最近基于学习的LiDAR odology方法显示了其竞争力。然而,大多数方法仍面临两大挑战:(1) 2D 预测显示LIDAR数据无法从点云中有效地编码 3D 结构;(2) 培训需要大量标签数据限制了这些方法的应用范围。在本文件中,我们提议了一种自我监督的LIDAR odology方法,称为SelfVoxeLO,以解决这两个难题。具体地说,我们提议建立一个3D Convolution网络,直接处理原始的LIDAR数据,从而提取出更好地编码 3D 几何模式的特征。为了适合我们的网络进行自我监督学习,我们设计了若干新的损失功能,利用LIDAR 点云的内在特性。此外,我们将不确定性识别机制纳入损失功能,以缓解移动对象/Novise的干扰。我们评估了两个大型数据集的性能,即KITTI和阿波罗-南BayBay。我们的方法在27/32的不可靠的数据转换方法中超越了状态的可比较性差性数据,包括以27/32的SIMBSD数据系统进行更精确的数据转换方法。