Scaling artist-designed meshes to high triangle numbers remains challenging for autoregressive generative models. Existing transformer-based methods suffer from long-sequence bottlenecks and limited quantization resolution, primarily due to the large number of tokens required and constrained quantization granularity. These issues prevent faithful reproduction of fine geometric details and structured density patterns. We introduce MeshMosaic, a novel local-to-global framework for artist mesh generation that scales to over 100K triangles--substantially surpassing prior methods, which typically handle only around 8K faces. MeshMosaic first segments shapes into patches, generating each patch autoregressively and leveraging shared boundary conditions to promote coherence, symmetry, and seamless connectivity between neighboring regions. This strategy enhances scalability to high-resolution meshes by quantizing patches individually, resulting in more symmetrical and organized mesh density and structure. Extensive experiments across multiple public datasets demonstrate that MeshMosaic significantly outperforms state-of-the-art methods in both geometric fidelity and user preference, supporting superior detail representation and practical mesh generation for real-world applications.
翻译:将艺术家设计的网格扩展到高三角形数量对于自回归生成模型仍然具有挑战性。现有的基于Transformer的方法受限于长序列瓶颈和有限的量化分辨率,这主要源于所需的大量令牌和受限的量化粒度。这些问题阻碍了对精细几何细节和结构化密度模式的忠实再现。我们提出了MeshMosaic,一种新颖的局部到全局框架,用于艺术家网格生成,可扩展至超过10万个三角形——显著超越了先前通常仅能处理约8千个面的方法。MeshMosaic首先将形状分割为补丁,自回归地生成每个补丁,并利用共享边界条件来促进相邻区域之间的连贯性、对称性和无缝连接。该策略通过单独量化补丁,增强了高分辨率网格的可扩展性,从而产生更对称、更有组织的网格密度和结构。在多个公共数据集上的广泛实验表明,MeshMosaic在几何保真度和用户偏好方面均显著优于最先进的方法,支持卓越的细节表示和实际应用中的实用网格生成。