Pairwise point cloud registration is a critical task for many applications, which heavily depends on finding the right correspondences from the two point clouds. However, the low overlap between the input point clouds makes the registration prone to fail, leading to mistaken overlapping and mismatched correspondences, especially in scenes where non-overlapping regions contain similar structures. In this paper, we present a unified bird's-eye view (BEV) model for jointly learning of 3D local features and overlap estimation to fulfill the pairwise registration and loop closure. Feature description based on BEV representation is performed by a sparse UNet-like network, and the 3D keypoints are extracted by a detection head for 2D locations and a regression head for heights, respectively. For overlap detection, a cross-attention module is applied for interacting contextual information of the input point clouds, followed by a classification head to estimate the overlapping region. We evaluate our unified model extensively on the KITTI dataset and Apollo-SouthBay dataset. The experiments demonstrate that our method significantly outperforms existing methods on overlap prediction, especially in scenes with small overlaps. The registration precision also achieves top performance on both datasets in terms of translation and rotation errors. Source codes will be available soon.
翻译:Pairwith点云登记是许多应用的关键任务,这在很大程度上取决于从两个点云中找到正确的对应信息。然而,输入点云之间的低重叠使得登记容易失败,导致错误的重叠和不匹配的对应信息,特别是在非重叠区域包含类似结构的场景中。在本文中,我们展示了一个统一的鸟眼视图模型,用于共同学习3D本地特征和重叠估计,以完成对称登记和循环结束。基于 BEV 表示方式的特征描述由一个稀少的类似UNet 的网络进行,3D 关键点由2D 位置的探测头和高度的回归头分别提取。对于重叠检测,一个交叉注意模块用于对输入点云的背景信息进行互动,然后用一个分类头来估计重叠区域。我们广泛评价了我们关于KITTI数据集和Apollo-SouthBay数据集的统一模型。实验表明,我们的方法大大超出了现有重叠预测方法,特别是在与小重叠的场景中。对于数据代码的翻译也将很快实现最高性能。</s>