Procedural models (i.e. symbolic programs that output visual data) are a historically-popular method for representing graphics content: vegetation, buildings, textures, etc. They offer many advantages: interpretable design parameters, stochastic variations, high-quality outputs, compact representation, and more. But they also have some limitations, such as the difficulty of authoring a procedural model from scratch. More recently, AI-based methods, and especially neural networks, have become popular for creating graphic content. These techniques allow users to directly specify desired properties of the artifact they want to create (via examples, constraints, or objectives), while a search, optimization, or learning algorithm takes care of the details. However, this ease of use comes at a cost, as it's often hard to interpret or manipulate these representations. In this state-of-the-art report, we summarize research on neurosymbolic models in computer graphics: methods that combine the strengths of both AI and symbolic programs to represent, generate, and manipulate visual data. We survey recent work applying these techniques to represent 2D shapes, 3D shapes, and materials & textures. Along the way, we situate each prior work in a unified design space for neurosymbolic models, which helps reveal underexplored areas and opportunities for future research.
翻译:符号程序模型(即输出可视化数据的符号程序)是表示图形内容的历史上流行的方法之一:植被、建筑、纹理等。它们具有许多优点:可解释的设计参数、随机变化、高质量的输出、紧凑的表示等等。但它们也有一些局限性,例如从头开始创建符号程序模型的难度。最近,AI-based方法,特别是神经网络,已经成为创建图形内容的流行方法。这些技术允许用户直接指定他们想要创建的工件的所需属性(通过示例、约束或目标),而搜索、优化或学习算法负责处理细节。然而,这种使用方便是有代价的,因为往往很难解释或操作这些表示。在这份最新报告中,我们总结了在计算机图形学中神经符号模型的研究:将AI和符号程序的优势相结合,以表示、生成和操作视觉数据的方法。我们概述了近期的工作,将这些技术应用于表示2D形状、3D形状和材料和纹理。在此过程中,我们将每个先前工作置于一个神经符号模型的统一设计空间中,这有助于揭示未开发的领域和未来研究的机会。