Face editing represents a popular research topic within the computer vision and image processing communities. While significant progress has been made recently in this area, existing solutions: (i) are still largely focused on low-resolution images, (ii) often generate editing results with visual artefacts, or (iii) lack fine-grained control and alter multiple (entangled) attributes at once, when trying to generate the desired facial semantics. In this paper, we aim to address these issues though a novel attribute editing approach called MaskFaceGAN. The proposed approach is based on an optimization procedure that directly optimizes the latent code of a pre-trained (state-of-the-art) Generative Adversarial Network (i.e., StyleGAN2) with respect to several constraints that ensure: (i) preservation of relevant image content, (ii) generation of the targeted facial attributes, and (iii) spatially--selective treatment of local image areas. The constraints are enforced with the help of an (differentiable) attribute classifier and face parser that provide the necessary reference information for the optimization procedure. MaskFaceGAN is evaluated in extensive experiments on the CelebA-HQ, Helen and SiblingsDB-HQf datasets and in comparison with several state-of-the-art techniques from the literature, i.e., StarGAN, AttGAN, STGAN, and two versions of InterFaceGAN. Our experimental results show that the proposed approach is able to edit face images with respect to several facial attributes with unprecedented image quality and at high-resolutions (1024x1024), while exhibiting considerably less problems with attribute entanglement than competing solutions. The source code is made freely available from: https://github.com/MartinPernus/MaskFaceGAN.
翻译:虽然最近在这一领域取得了显著进展,但现有的解决方案是:(一) 仍然主要侧重于低分辨率图像,(二) 经常以视觉手工艺品生成编辑结果,或(三) 在试图生成所需的面部语义学时,缺乏精细控制,同时改变多种(纠缠的)属性。在本文件中,我们的目标是通过一种名为 Mask FaceGAN 的新型属性编辑方法来解决这些问题。提议的方法基于一个优化程序,直接优化预培训(状态-艺术)前(状态-艺术)的隐性代码,(二) 生成低分辨率图像,(二) 经常生成视觉手工艺制品的编辑结果,或(三) 在试图生成所需的面部语义语义语义描述时,(二) 在空间上对本地图像区域进行选择性的处理。通过一种(可理解的)可分级和面面面面面法方法,为优化程序提供必要的参考信息。Mask FasfaceGAN 高级图像网络(即StyGAN) 、Seal-Alieal-Q 和Salial-G Salib-H 的多项Salial-G 演示数据-G-G