Generative adversarial networks have been widely used in image synthesis in recent years and the quality of the generated image has been greatly improved. However, the flexibility to control and decouple facial attributes (e.g., eyes, nose, mouth) is still limited. In this paper, we propose a novel approach, called ChildGAN, to generate a child's image according to the images of parents with heredity prior. The main idea is to disentangle the latent space of a pre-trained generation model and precisely control the face attributes of child images with clear semantics. We use distances between face landmarks as pseudo labels to figure out the most influential semantic vectors of the corresponding face attributes by calculating the gradient of latent vectors to pseudo labels. Furthermore, we disentangle the semantic vectors by weighting irrelevant features and orthogonalizing them with Schmidt Orthogonalization. Finally, we fuse the latent vector of the parents by leveraging the disentangled semantic vectors under the guidance of biological genetic laws. Extensive experiments demonstrate that our approach outperforms the existing methods with encouraging results.
翻译:近年来,在图像合成中广泛使用了生成对抗网络,生成图像的质量大为改善,但控制和分辨面部特征(例如眼睛、鼻子、嘴)的灵活性仍然有限。在本文中,我们提议采用名为ChildGAN的新颖方法,根据父母先前的遗传性图像生成儿童图像。主要想法是分离预先训练的生成模型的潜伏空间,用清晰的语义来精确控制儿童图像的表面属性。我们使用面部标志之间的距离作为假标签,通过计算隐性矢量的梯度到假标签来找出相应面部属性中最有影响的语义矢量。此外,我们通过加权不相干的特点和将它们与施密特·奥特戈纳化分解,将语系矢量与父母的潜在矢量相融合,在生物遗传法的指导下利用不相融合的语义矢量。广泛的实验表明,我们的方法超越了现有方法,结果令人鼓舞。