Capitalizing on the recent advances in image generation models, existing controllable face image synthesis methods are able to generate high-fidelity images with some levels of controllability, e.g., controlling the shapes, expressions, textures, and poses of the generated face images. However, these methods focus on 2D image generative models, which are prone to producing inconsistent face images under large expression and pose changes. In this paper, we propose a new NeRF-based conditional 3D face synthesis framework, which enables 3D controllability over the generated face images by imposing explicit 3D conditions from 3D face priors. At its core is a conditional Generative Occupancy Field (cGOF) that effectively enforces the shape of the generated face to commit to a given 3D Morphable Model (3DMM) mesh. To achieve accurate control over fine-grained 3D face shapes of the synthesized image, we additionally incorporate a 3D landmark loss as well as a volume warping loss into our synthesis algorithm. Experiments validate the effectiveness of the proposed method, which is able to generate high-fidelity face images and shows more precise 3D controllability than state-of-the-art 2D-based controllable face synthesis methods. Find code and demo at https://keqiangsun.github.io/projects/cgof.
翻译:根据图像生成模型的最新进展,现有可控面相图像合成方法能够产生具有一定可控性水平的高不忠图像,例如控制形状、表达式、质地和生成面图像的形状、表达式、质地和形状。然而,这些方法侧重于2D图像变形模型,这些模型容易在大表情下产生不一致的面像,并带来变化。在本文件中,我们提议一个新的基于 NeRF 的3D 有条件的3D 面相合成框架,通过从 3D 面部前端强加明确的 3D 条件,使生成的面相图像的3D 可控性3D。其核心是一个有条件的 Generalation Occupation 字段(cGOF),可以有效强制生成生成所生成的面相貌形状的形状以承诺给定的 3D Morphable 模型(3DMM) 网格。为了准确控制合成图像的3D脸形状,我们还将3D标志性损失纳入我们的合成算法中。实验验证了拟议方法的有效性,该方法能够生成高不相容度面/可控 2.图像。