Geometric feature learning for 3D surfaces is critical for many applications in computer graphics and 3D vision. However, deep learning currently lags in hierarchical modeling of 3D surfaces due to the lack of required operations and/or their efficient implementations. In this paper, we propose a series of modular operations for effective geometric feature learning from 3D triangle meshes. These operations include novel mesh convolutions, efficient mesh decimation and associated mesh (un)poolings. Our mesh convolutions exploit spherical harmonics as orthonormal bases to create continuous convolutional filters. The mesh decimation module is GPU-accelerated and able to process batched meshes on-the-fly, while the (un)pooling operations compute features for up/down-sampled meshes. We provide open-source implementation of these operations, collectively termed Picasso. Picasso supports heterogeneous mesh batching and processing. Leveraging its modular operations, we further contribute a novel hierarchical neural network for perceptual parsing of 3D surfaces, named PicassoNet++. It achieves highly competitive performance for shape analysis and scene segmentation on prominent 3D benchmarks. The code, data and trained models are available at https://github.com/EnyaHermite/Picasso.
翻译:3D表面的几何特征学习对于计算机图形学和3D视觉的许多应用至关重要。然而,由于缺乏必需的操作和/或高效的实现,深度学习在层次建模3D表面方面目前存在巨大的差距。在本文中,我们提出了一系列用于从3D三角网格中实现有效的几何特征学习的模块化操作。这些操作包括新颖的网格卷积,高效的网格简化和相关的网格(解)池化。我们的网格卷积利用球谐函数作为正交基础来创建连续的卷积滤波器。网格简化模块是GPU加速的,能够实时处理批量网格,而(解)池化操作计算上/下采样网格的特征。我们提供了这些操作的开源实现,统称为Picasso。Picasso支持异构网格批量处理。借助其模块化操作,我们进一步贡献了一种新颖的层次神经网络,用于感知解析3D表面,命名为PicassoNet++。它在重要的3D基准测试中实现了极具竞争力的形状分析和场景分割性能。代码、数据和训练模型可在https://github.com/EnyaHermite/Picasso找到。