Implicit generative models have been widely employed to model 3D data and have recently proven to be successful in encoding and generating high-quality 3D shapes. This work builds upon these models and alleviates current limitations by presenting the first implicit generative model that facilitates the generation of complex 3D shapes with rich internal geometric details. To achieve this, our model uses unsigned distance fields to represent nested 3D surfaces allowing learning from non-watertight mesh data. We propose a transformer-based autoregressive model for 3D shape generation that leverages context-rich tokens from vector quantized shape embeddings. The generated tokens are decoded into an unsigned distance field which is rendered into a novel 3D shape exhibiting a rich internal structure. We demonstrate that our model achieves state-of-the-art point cloud generation results on popular classes of 'Cars', 'Planes', and 'Chairs' of the ShapeNet dataset. Additionally, we curate a dataset that exclusively comprises shapes with realistic internal details from the `Cars' class of ShapeNet and demonstrate our method's efficacy in generating these shapes with internal geometry.
翻译:隐式生成模型已被广泛应用于建模三维数据,并最近被证明在编码和生成高质量三维形状方面非常成功。本文基于这些模型,通过提出第一个隐式生成模型,可以方便地生成具有丰富内部几何细节的复杂三维形状,从而缓解目前的限制。为了实现这一目标,我们的模型使用无符号距离场来表示嵌套的三维表面,从而允许从非封闭网格数据中进行学习。我们提出了一种基于变压器的自回归模型,用于三维形状生成,利用来自向量量化形状嵌入的上下文丰富的标记。生成的标记被解码为一个无符号距离场,然后渲染成一种新颖的三维形状,展示出丰富的内部结构。我们证明了我们的模型在ShapeNet数据集的“汽车”、“飞机”和“椅子”等热门类别的点云生成结果方面实现了最先进的成果。此外,我们还策划了一个专门包括来自ShapeNet的“汽车”类别的具有实际内部细节的形状的数据集,并展示了我们的方法在生成这些具有内部几何的形状方面的有效性。