学习深强化学习政策,在经过培训的用于语义时代操纵的预培训GAN的前沿空间学习 (Learning a Deep Reinforcement Learning Policy Over the Latent Space of a Pre-trained GAN for Semantic Age Manipulation)

Learning a disentangled representation of the latent space has become one of the most fundamental problems studied in computer vision. Recently, many Generative Adversarial Networks (GANs) have shown promising results in generating high fidelity images. However, studies to understand the semantic layout of the latent space of pre-trained models are still limited. Several works train conditional GANs to generate faces with required semantic attributes. Unfortunately, in these attempts, the generated output is often not as photo-realistic as the unconditional state-of-the-art models. Besides, they also require large computational resources and specific datasets to generate high fidelity images. In our work, we have formulated a Markov Decision Process (MDP) over the latent space of a pre-trained GAN model to learn a conditional policy for semantic manipulation along specific attributes under defined identity bounds. Further, we have defined a semantic age manipulation scheme using a locally linear approximation over the latent space. Results show that our learned policy samples high fidelity images with required age alterations, while preserving the identity of the person.

翻译：学习潜伏空间的分解表达方式已成为计算机视觉中研究的最根本问题之一。最近,许多创世对立网络(GANs)在生成高忠诚图像方面显示出令人乐观的结果。然而,为了解预先训练的模型潜在空间的语义布局而进行的研究仍然有限。几项工作训练有条件的GANs, 以生成必要的语义属性的面孔。不幸的是,在这些尝试中, 产生的输出往往没有像无条件最新模型那样真实的图像。此外, 它们还需要大量的计算资源和具体的数据集来生成高忠诚图像。在我们的工作中, 我们为预先训练过的GAN模型的潜在空间制定了Markov决定程序(MDP ), 以学习一种有条件的政策, 用于在特定身份界限下的特定属性进行语义性操纵。此外, 我们定义了一种语义年龄操纵计划, 使用局部的线性近近于暗处。结果显示, 我们所学的政策对高忠诚图像进行了必要的年龄改变, 同时保留了个人的身份。