PointNet++ is one of the most influential neural architectures for point cloud understanding. Although the accuracy of PointNet++ has been largely surpassed by recent networks such as PointMLP and Point Transformer, we find that a large portion of the performance gain is due to improved training strategies, i.e. data augmentation and optimization techniques, and increased model sizes rather than architectural innovations. Thus, the full potential of PointNet++ has yet to be explored. In this work, we revisit the classical PointNet++ through a systematic study of model training and scaling strategies, and offer two major contributions. First, we propose a set of improved training strategies that significantly improve PointNet++ performance. For example, we show that, without any change in architecture, the overall accuracy (OA) of PointNet++ on ScanObjectNN object classification can be raised from 77.9\% to 86.1\%, even outperforming state-of-the-art PointMLP. Second, we introduce an inverted residual bottleneck design and separable MLPs into PointNet++ to enable efficient and effective model scaling and propose PointNeXt, the next version of PointNets. PointNeXt can be flexibly scaled up and outperforms state-of-the-art methods on both 3D classification and segmentation tasks. For classification, PointNeXt reaches an overall accuracy of $87.7\%$ on ScanObjectNN, surpassing PointMLP by $2.3\%$, while being $10 \times$ faster in inference. For semantic segmentation, PointNeXt establishes a new state-of-the-art performance with $74.9\%$ mean IoU on S3DIS (6-fold cross-validation), being superior to the recent Point Transformer. The code and models are available at https://github.com/guochengqian/pointnext.
翻译:点网++ 是了解点云的最有影响力的神经结构之一。 虽然点网++ 的准确性已被最近一些网络, 如PpointMLP 和点变换器, 大大超过点网++ 的准确性, 但我们发现, 大部分的性能增益是由于改进了培训战略, 即数据增强和优化技术, 以及模型规模的扩大, 而不是建筑创新。 因此, 点网+++ 的全部潜力还有待探索。 在这项工作中, 我们通过系统研究模式培训和规模战略, 重新审视经典点网+++。 首先, 我们提出了一套改进的训练战略, 大大改进了点网++的性能。 例如, 我们发现, 在架构中没有任何变化, 点网++G( OA) 的总体精度( ScampObject Nation) 的准确性能从77. 9 ⁇ 提高到86.1 ⁇ , 甚至低于标准值的状态。 第二, 我们引入了一种倒置的残余的瓶价- $ 和 QPP+++,,,, 在点网 上, 可以进行高效的模型的升级的升级的升级的 Sral- dealxx 。