Reconstructing high-fidelity 3D facial texture from a single image is a challenging task since the lack of complete face information and the domain gap between the 3D face and 2D image. The most recent works tackle facial texture reconstruction problem by applying either generation-based or reconstruction-based methods. Although each method has its own advantage, none of them is capable of recovering a high-fidelity and re-renderable facial texture, where the term 're-renderable' demands the facial texture to be spatially complete and disentangled with environmental illumination. In this paper, we propose a novel self-supervised learning framework for reconstructing high-quality 3D faces from single-view images in-the-wild. Our main idea is to first utilize the prior generation module to produce a prior albedo, then leverage the detail refinement module to obtain detailed albedo. To further make facial textures disentangled with illumination, we present a novel detailed illumination representation which is reconstructed with the detailed albedo together. We also design several regularization loss functions on both the albedo side and illumination side to facilitate the disentanglement of these two factors. Finally, thanks to the differentiable rendering technique, our neural network can be efficiently trained in a self-supervised manner. Extensive experiments on challenging datasets demonstrate that our framework substantially outperforms state-of-the-art approaches in both qualitative and quantitative comparisons.
翻译:从单一图像中重建高纤维化 3D 面部纹理是一项具有挑战性的任务,因为缺乏完整的面部信息以及3D脸和2D图像之间的域差。最近的工作通过采用以代为基础的或以重建为基础的方法来解决面部纹理重建问题。虽然每种方法都有其自身的优势,但它们没有一个能够从高纤维化和可再现的面部纹理中恢复高纤维化和可再现的面部纹理。为了进一步使面部纹理与不洁相混淆,我们提出了一个与环境照明相混淆的空间完整的新的详细描述。在本文中,我们提出了一个全新的自我监督的比较学习框架,用于重建三D脸部的高质量图像。我们的主要想法是首先利用前一代的模范来生成一个先前的升温,然后利用细节改进模块来获取详细的高温化。为了进一步使面部纹理脱落,我们展示了与详细平面图一起重建的面部透析。我们还设计了几个自我监督的自我监督的学习框架。我们设计了一些自我监督的自我校正损失功能, 最终在高压化的网络技术上展示了我们不同的自我变形,可以使系统变形变形变形变形的自我变形的自我变形。