Lipschitz网络转换Radiance Field用于逼真的3D场景风格化 (Transforming Radiance Field with Lipschitz Network for Photorealistic 3D Scene Stylization)

Recent advances in 3D scene representation and novel view synthesis have witnessed the rise of Neural Radiance Fields (NeRFs). Nevertheless, it is not trivial to exploit NeRF for the photorealistic 3D scene stylization task, which aims to generate visually consistent and photorealistic stylized scenes from novel views. Simply coupling NeRF with photorealistic style transfer (PST) will result in cross-view inconsistency and degradation of stylized view syntheses. Through a thorough analysis, we demonstrate that this non-trivial task can be simplified in a new light: When transforming the appearance representation of a pre-trained NeRF with Lipschitz mapping, the consistency and photorealism across source views will be seamlessly encoded into the syntheses. That motivates us to build a concise and flexible learning framework namely LipRF, which upgrades arbitrary 2D PST methods with Lipschitz mapping tailored for the 3D scene. Technically, LipRF first pre-trains a radiance field to reconstruct the 3D scene, and then emulates the style on each view by 2D PST as the prior to learn a Lipschitz network to stylize the pre-trained appearance. In view of that Lipschitz condition highly impacts the expressivity of the neural network, we devise an adaptive regularization to balance the reconstruction and stylization. A gradual gradient aggregation strategy is further introduced to optimize LipRF in a cost-efficient manner. We conduct extensive experiments to show the high quality and robust performance of LipRF on both photorealistic 3D stylization and object appearance editing.

翻译：近年来，神经辐射场（NeRF）的发展使得三维场景的表现和新视角综合变得更加便捷。然而，如何将NeRF应用于需要生成视觉一致且逼真的风格化场景的领域内，仍然没有简单的方法。仅仅通过将NeRF和逼真的风格迁移（PST）方法结合将导致视角不一致和掉帧的问题。通过深度分析，我们证明这一复杂的任务可以从另一方面简化：当利用Lipschitz映射转换NeRF的表现表示时，一致性和逼真性将被顺畅地编码到合成中，不再需要引入PST方法。这激励我们构建了一种简洁而灵活的学习框架 LipRF，以Lipschitz映射为基础，可以将任意的2D PST方法转换成适用于3D场景的方法。LipRF首先预训练辐射场以重构3D场景。然后，在每个视角上通过2D PST方法实现将先前学习到的风格加入，然后再学习Lipschitz网络进行风格化。由于Lipschitz条件对于神经网络的表达能力有很大影响，我们设计了一个自适应正则化方法来平衡重构和风格化。进一步引入逐渐梯度聚合策略来以更高效的方式优化LipRF。我们进行了大量实验，以展示LipRF在逼真的3D风格化和物体外观编辑方面的高质量和鲁棒性。