Rendering scenes with a high-quality human face from arbitrary viewpoints is a practical and useful technique for many real-world applications. Recently, Neural Radiance Fields (NeRF), a rendering technique that uses neural networks to approximate classical ray tracing, have been considered as one of the promising approaches for synthesizing novel views from a sparse set of images. We find that NeRF can render new views while maintaining geometric consistency, but it does not properly maintain skin details, such as moles and pores. These details are important particularly for faces because when we look at an image of a face, we are much more sensitive to details than when we look at other objects. On the other hand, 3D Morpable Models (3DMMs) based on traditional meshes and textures can perform well in terms of skin detail despite that it has less precise geometry and cannot cover the head and the entire scene with background. Based on these observations, we propose a method to use both NeRF and 3DMM to synthesize a high-fidelity novel view of a scene with a face. Our method learns a Generative Adversarial Network (GAN) to mix a NeRF-synthesized image and a 3DMM-rendered image and produces a photorealistic scene with a face preserving the skin details. Experiments with various real-world scenes demonstrate the effectiveness of our approach. The code will be available on https://github.com/showlab/headshot .
翻译:从任意的角度看,用高品质的人类面孔描绘场景是许多真实世界应用的一种实用和有用的技术。 最近,神经辐射场(NERF)是一种利用神经网络来大致古典射线追踪的技术,被视作一种很有希望的方法,可以将来自一幅稀疏图像的新观点合成成一幅。我们发现NERF既可以提供新的观点,同时又保持几何一致性,但并不适当保持皮肤细节,例如摩尔和孔。这些细节对于面部来说尤其重要,因为当我们看一张脸时,我们比看其他物体时更敏感地了解细节。另一方面,基于传统线性光学和纹理的3DMMM模型(3DMMMMM)可以很好地在皮肤细节方面表现得更好,尽管它不那么精确,而且不能以背景来覆盖头部和整个场景。基于这些观察,我们建议一种方法,即使用NERF和3DMMMM, 来合成一张面的高纤维新面面图像。我们的方法是真实的AD-RVS-Simalimalmagraphical 和图像的图像模型模型模型模型。我们用一个真实的图像模型模型模型模型和图像模型模型模型模型的模型模型模型模型模型模型模型模型模型模型和图像模型的模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型模型。