Convolutional neural networks (CNNs) have made great breakthroughs in 2D computer vision. However, their irregular structure makes it hard to harness the potential of CNNs directly on meshes. A subdivision surface provides a hierarchical multi-resolution structure, in which each face in a closed 2-manifold triangle mesh is exactly adjacent to three faces. Motivated by these two observations, this paper presents SubdivNet, an innovative and versatile CNN framework for 3D triangle meshes with Loop subdivision sequence connectivity. Making an analogy between mesh faces and pixels in a 2D image allows us to present a mesh convolution operator to aggregate local features from nearby faces. By exploiting face neighborhoods, this convolution can support standard 2D convolutional network concepts, e.g. variable kernel size, stride, and dilation. Based on the multi-resolution hierarchy, we make use of pooling layers which uniformly merge four faces into one and an upsampling method which splits one face into four. Thereby, many popular 2D CNN architectures can be easily adapted to process 3D meshes. Meshes with arbitrary connectivity can be remeshed to have Loop subdivision sequence connectivity via self-parameterization, making SubdivNet a general approach. Extensive evaluation and various applications demonstrate SubdivNet's effectiveness and efficiency.
翻译:革命神经网络(CNNs)在 2D 计算机视野中取得了巨大的突破。 然而,它们的非常规结构使得很难直接利用CNN在 meshes 上的潜力。 一个子分层表面提供了一种分层的多分辨率结构,在这个结构中,每个面部在封闭的2张三角三角网中,都与三个面孔完全相邻。在这两个观察的推动下,本文展示了SubdivNet,一个创新和多功能的CNN3三角网际网际网际网际框架,与Loop子相连接。在 2D 图像的网形脸和像素之间做一个类比,让我们能够展示一个网形图面和像的像。通过利用相邻的相邻区块,这种组合可以支持标准的2D革命网络网络概念,例如变形的内心、变形和变形形形形形形形形形形形形形形的网络。根据多分辨率的等级,我们使用将4个面形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形色形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形形色色色色色色色色色色