Semantic understanding of 3D point cloud relies on learning models with massively annotated data, which, in many cases, are expensive or difficult to collect. This has led to an emerging research interest in semi-supervised learning (SSL) for 3D point cloud. It is commonly assumed in SSL that the unlabeled data are drawn from the same distribution as that of the labeled ones; This assumption, however, rarely holds true in realistic environments. Blindly using out-of-distribution (OOD) unlabeled data could harm SSL performance. In this work, we propose to selectively utilize unlabeled data through sample weighting, so that only conducive unlabeled data would be prioritized. To estimate the weights, we adopt a bi-level optimization framework which iteratively optimizes a metaobjective on a held-out validation set and a task-objective on a training set. Faced with the instability of efficient bi-level optimizers, we further propose three regularization techniques to enhance the training stability. Extensive experiments on 3D point cloud classification and segmentation tasks verify the effectiveness of our proposed method. We also demonstrate the feasibility of a more efficient training strategy.
翻译:对三维点云的语义理解取决于学习模型,这些模型含有大量附加说明的数据,在许多情况下,这些数据是昂贵的或难以收集的。这导致对三维点云半监督学习(SSL)的研究兴趣的出现。在SSL中,通常假设未贴标签的数据来自与标签的云的相同分布;然而,这一假设在现实环境中很少是真实的。盲目地使用分配外(OOOD)未贴标签的数据可能会损害SSL的性能。在这项工作中,我们提议通过抽样权重选择使用未贴标签的数据,以便仅将有利的无标签数据列为优先事项。为了估计权重,我们采用了双级优化框架,反复优化搁置验证数据集的元目标以及培训组的任务目标。面对高效的双级优化器的不稳定,我们进一步提议三种正规化技术,以加强培训稳定性。关于三维点云分类和分解任务的广泛实验将核查我们拟议方法的有效性。我们还展示了更高效的培训战略的可行性。