Processing point cloud data is an important component of many real-world systems. As such, a wide variety of point-based approaches have been proposed, reporting steady benchmark improvements over time. We study the key ingredients of this progress and uncover two critical results. First, we find that auxiliary factors like different evaluation schemes, data augmentation strategies, and loss functions, which are independent of the model architecture, make a large difference in performance. The differences are large enough that they obscure the effect of architecture. When these factors are controlled for, PointNet++, a relatively older network, performs competitively with recent methods. Second, a very simple projection-based method, which we refer to as SimpleView, performs surprisingly well. It achieves on par or better results than sophisticated state-of-the-art methods on ModelNet40 while being half the size of PointNet++. It also outperforms state-of-the-art methods on ScanObjectNN, a real-world point cloud benchmark, and demonstrates better cross-dataset generalization. Code is available at https://github.com/princeton-vl/SimpleView.
翻译:云点处理数据是许多现实世界系统的一个重要组成部分。 因此,提出了各种各样的基于点点的方法,报告长期稳步的基准改进。 我们研究了这一进展的关键要素,发现了两个关键结果。 首先,我们发现,独立于模型架构的辅助因素,如不同的评价计划、数据增强战略和损失函数,在性能方面有很大不同。 差异很大,以至于掩盖了结构的影响。 当这些因素被控制时, 一个相对较老的网络PointNet++, 以最近的方法进行竞争。 其次, 一种非常简单的基于投影的方法,我们称之为“简单View”, 表现得令人惊讶。 它比模型Net40 的尖端先进方法取得相同或更好的结果, 却比PointNet++的一半还要高。 它也超越了ScanObjectNNN(一个真实世界点云基准)的先进方法, 并展示了更好的交叉数据总化。 代码可在 https://github.com/prenceton-vl/SpremalyV)。