Semantic face editing of real world facial images is an important application of generative models. Recently, multiple works have explored possible techniques to generate such modifications using the latent structure of pre-trained GAN models. However, such approaches often require training an encoder network and that is typically a time-consuming and resource intensive process. A possible alternative to such a GAN-based architecture can be styleALAE, a latent-space based autoencoder that can generate photo-realistic images of high quality. Unfortunately, the reconstructed image in styleALAE does not preserve the identity of the input facial image. This limits the application of styleALAE for semantic face editing of images with known identities. In our work, we use a recent advancement in one-shot domain adaptation to address this problem. Our work ensures that the identity of the reconstructed image is the same as the given input image. We further generate semantic modifications over the reconstructed image by using the latent space of the pre-trained styleALAE model. Results show that our approach can generate semantic modifications on any real world facial image while preserving the identity.
翻译:真实世界面部图像的语义面部编辑是基因模型的一个重要应用。 最近,多部作品探索了利用预先训练过的GAN模型的潜在结构进行这种修改的可能技术。 但是,这类方法往往需要培训编码器网络,通常是一个耗时和资源密集的过程。 以GAN为基础的结构的一个可能的替代方法可以是StyALAE,一个基于潜空的自动编码器,可以生成高质量的照片现实图像。 不幸的是,SysteALAE中重建后的图像无法保存输入面部图像的特性。这限制了StyALE用于对已知身份图像的语义面部编辑的应用。 在我们的工作中,我们利用最近的一次性域适应进展来解决这个问题。我们的工作确保重建后的图像的特性与给定的输入图像相同。我们还利用预先训练过的SyALAE模型的潜在空间对重建后的图像产生语义上的修改。结果显示,我们的方法可以在保存身份的同时对任何真实世界面部图像产生语义上的修改。