Learning an effective representation of 3D point clouds requires a good metric to measure the discrepancy between two 3D point sets, which is non-trivial due to their irregularity. Most of the previous works resort to using the Chamfer discrepancy or Earth Mover's distance, but those metrics are either ineffective in measuring the differences between point clouds or computationally expensive. In this paper, we conduct a systematic study with extensive experiments on distance metrics for 3D point clouds. From this study, we propose to use sliced Wasserstein distance and its variants for learning representations of 3D point clouds. In addition, we introduce a new algorithm to estimate sliced Wasserstein distance that guarantees that the estimated value is close enough to the true one. Experiments show that the sliced Wasserstein distance and its variants allow the neural network to learn a more efficient representation compared to the Chamfer discrepancy. We demonstrate the efficiency of the sliced Wasserstein metric and its variants on several tasks in 3D computer vision including training a point cloud autoencoder, generative modeling, transfer learning, and point cloud registration.
翻译:学习3D点云的有效表示要求有一个良好的衡量尺度来衡量两组3D点云之间的差异,因为两组非三维点云的不规律性。先前的工作大多采用Chamfer差异或地球移动器的距离,但这些衡量尺度在测量点云的差异方面要么是无效的,要么是计算成本昂贵的。在本文中,我们进行了一项系统研究,对3D点云的距离指标进行了广泛的实验。从这项研究中,我们提议使用切片瓦塞尔斯坦距离及其变量来测量3D点云的学习表现。此外,我们引入了一种新的算法来估计瓦塞尔斯坦距离,以确保估计值与真实的距离足够接近。实验显示,切片瓦塞斯坦距离及其变量使得神经网络能够学习比Chamfer差异更有效率的表达方式。我们展示了切片瓦塞斯坦指标及其3D计算机愿景的若干任务变量的效率,包括培训点云解解、基因化模型、转移模型和点云登记。