High-fidelity facial avatar reconstruction from a monocular video is a significant research problem in computer graphics and computer vision. Recently, Neural Radiance Field (NeRF) has shown impressive novel view rendering results and has been considered for facial avatar reconstruction. However, the complex facial dynamics and missing 3D information in monocular videos raise significant challenges for faithful facial reconstruction. In this work, we propose a new method for NeRF-based facial avatar reconstruction that utilizes 3D-aware generative prior. Different from existing works that depend on a conditional deformation field for dynamic modeling, we propose to learn a personalized generative prior, which is formulated as a local and low dimensional subspace in the latent space of 3D-GAN. We propose an efficient method to construct the personalized generative prior based on a small set of facial images of a given individual. After learning, it allows for photo-realistic rendering with novel views and the face reenactment can be realized by performing navigation in the latent space. Our proposed method is applicable for different driven signals, including RGB images, 3DMM coefficients, and audios. Compared with existing works, we obtain superior novel view synthesis results and faithfully face reenactment performance.
翻译:以单视视像为主的高纤维面部骨浆重建是计算机图形和计算机视觉中的一个重要研究问题。最近,神经辐射场(NeRF)展示了令人印象深刻的新观点,取得了令人印象深刻的新观点,并被考虑用于面部骨浆重建。然而,单视视频中复杂的面部动态和缺失的三维信息对忠实面部重建提出了重大挑战。在这项工作中,我们提出了一个基于NeRF的面部骨浆重建的新方法,该方法使用了之前3D-认知的基因变异功能。与现有工程不同,这取决于动态建模的有条件的变形场,我们提议在之前学习个性化基因变异功能,这是在3D-GAN潜藏空间作为局部和低维次空间的子空间制定的。我们提出了一个有效的方法,用来根据特定个人的一小组面部面部面部图像构建个性化基因变异功能。在学习后,通过在暗视空间进行导航,可以实现光真化和面反应。我们提出的方法适用于不同驱动的信号,包括RGB图像、3DMM系数和真实性合成结果以及音频合成。比较现有作品的合成结果。