Point cloud data plays an essential role in robotics and self-driving applications. Yet, annotating point cloud data is time-consuming and nontrivial while they enable learning discriminative 3D representations that empower downstream tasks, such as classification and segmentation. Recently, contrastive learning-based frameworks have shown promising results for learning 3D representations in a self-supervised manner. However, existing contrastive learning methods cannot precisely encode and associate structural features and search the higher dimensional augmentation space efficiently. In this paper, we present CLR-GAM, a novel contrastive learning-based framework with Guided Augmentation (GA) for efficient dynamic exploration strategy and Guided Feature Mapping (GFM) for similar structural feature association between augmented point clouds. We empirically demonstrate that the proposed approach achieves state-of-the-art performance on both simulated and real-world 3D point cloud datasets for three different downstream tasks, i.e., 3D point cloud classification, few-shot learning, and object part segmentation.
翻译:点云数据在机器人和自我驱动应用中发挥着必不可少的作用。然而,点云数据说明点云数据既耗时又非三维性,同时有助于学习具有歧视性的三维表现方式,赋予下游任务,例如分类和分化等权力。最近,对比式学习框架显示,以自我监督的方式学习三维表现方式有希望的结果。然而,现有的对比式学习方法无法精确地编码和结合结构特征,也无法有效搜索更高维度增强空间。在本文件中,我们介绍了CLR-GAM,这是一个与方向增强(GA)的对比式学习基础框架,用于高效动态探索战略,而向导地貌绘图(GM)则用于增强点云之间的类似结构特征联系。我们从经验上证明,拟议的方法在模拟和实际世界的三维点云数据集上都实现了三种不同下游任务的最新性表现,即3D点云分类、微小的学习和对象部分分割。</s>