一种可泛化的光传输三维嵌入方法用于全局光照 (A Generalizable Light Transport 3D Embedding for Global Illumination)

Global illumination (GI) is essential for realistic rendering but remains computationally expensive due to the complexity of simulating indirect light transport. Recent neural methods have mainly relied on per-scene optimization, sometimes extended to handle changes in camera or geometry. Efforts toward cross-scene generalization have largely stayed in 2D screen space, such as neural denoising or G-buffer based GI prediction, which often suffer from view inconsistency and limited spatial understanding. We propose a generalizable 3D light transport embedding that approximates global illumination directly from 3D scene configurations, without using rasterized or path-traced cues. Each scene is represented as a point cloud with geometric and material features. A scalable transformer models global point-to-point interactions to encode these features into neural primitives. At render time, each query point retrieves nearby primitives via nearest-neighbor search and aggregates their latent features through cross-attention to predict the desired rendering quantity. We demonstrate results on diffuse global illumination prediction across diverse indoor scenes with varying layouts, geometry, and materials. The embedding trained for irradiance estimation can be quickly adapted to new rendering tasks with limited fine-tuning. We also present preliminary results for spatial-directional radiance field estimation for glossy materials and show how the normalized field can accelerate unbiased path guiding. This approach highlights a path toward integrating learned priors into rendering pipelines without explicit ray-traced illumination cues.

翻译：全局光照对于实现真实感渲染至关重要，但由于模拟间接光传输的复杂性，其计算成本仍然高昂。近期的神经方法主要依赖于逐场景优化，有时会扩展以处理相机或几何体的变化。实现跨场景泛化的努力大多停留在二维屏幕空间，例如神经降噪或基于G缓冲区的全局光照预测，这些方法通常存在视角不一致性和空间理解有限的问题。我们提出了一种可泛化的三维光传输嵌入方法，能够直接从三维场景配置中近似全局光照，而无需使用光栅化或路径追踪的提示信息。每个场景被表示为具有几何和材质特征的点云。一个可扩展的Transformer模型通过全局点对点交互对这些特征进行建模，将其编码为神经基元。在渲染时，每个查询点通过最近邻搜索检索附近的基元，并通过交叉注意力聚合其潜在特征，以预测所需的渲染量。我们在具有不同布局、几何和材质的多样化室内场景上展示了漫反射全局光照预测的结果。为辐照度估计训练的嵌入可以通过有限的微调快速适应新的渲染任务。我们还展示了针对光泽材质的空间-方向辐射场估计的初步结果，并说明了归一化场如何加速无偏路径引导。这种方法为将学习到的先验知识集成到渲染管线中，而无需显式的光线追踪光照提示，指明了一条路径。