Previous portrait image generation methods roughly fall into two categories: 2D GANs and 3D-aware GANs. 2D GANs can generate high fidelity portraits but with low view consistency. 3D-aware GAN methods can maintain view consistency but their generated images are not locally editable. To overcome these limitations, we propose FENeRF, a 3D-aware generator that can produce view-consistent and locally-editable portrait images. Our method uses two decoupled latent codes to generate corresponding facial semantics and texture in a spatial aligned 3D volume with shared geometry. Benefiting from such underlying 3D representation, FENeRF can jointly render the boundary-aligned image and semantic mask and use the semantic mask to edit the 3D volume via GAN inversion. We further show such 3D representation can be learned from widely available monocular image and semantic mask pairs. Moreover, we reveal that joint learning semantics and texture helps to generate finer geometry. Our experiments demonstrate that FENeRF outperforms state-of-the-art methods in various face editing tasks.
翻译:先前的肖像生成方法大致可分为两类: 2D GANs 和 3D-aware GANs 。 2D GANs 能够生成高度忠诚的肖像,但视觉一致性较低。 3D-aware GAN 方法可以保持视图一致性, 但生成的图像无法在本地编辑。 为了克服这些限制, 我们提议 FENeRF, 3D-aware 生成器, 3D 生成可生成视觉一致和可本地编辑的肖像图像。 我们的方法使用两种分解的潜在代码, 在空间对齐的 3D 卷中生成相应的面部语义和纹理。 从这种基底部的 3D 表示法中获益, FENeRF 能够联合制作边界对齐图像和语义遮罩, 并使用语义遮罩通过 GAN 转换来编辑 3D 音量。 我们进一步显示, 3D 代表器可以从广泛可用的单形图像和语义遮罩中学习。 此外, 我们显示, 联合学习语义和纹有助于生成细度的几何测量。 我们的实验显示, 我们的实验显示 FENR 超越了各种面编辑任务中的状态。