Despite significant advances in image-to-image (I2I) translation with generative adversarial networks (GANs), it remains challenging to effectively translate an image to a set of diverse images in multiple target domains using a single pair of generator and discriminator. Existing I2I translation methods adopt multiple domain-specific content encoders for different domains, where each domain-specific content encoder is trained with images from the same domain only. Nevertheless, we argue that the content (domain-invariance) features should be learned from images among all of the domains. Consequently, each domain-specific content encoder of existing schemes fails to extract the domain-invariant features efficiently. To address this issue, we present a flexible and general SoloGAN model for efficient multimodal I2I translation among multiple domains with unpaired data. In contrast to existing methods, the SoloGAN algorithm uses a single projection discriminator with an additional auxiliary classifier and shares the encoder and generator for all domains. Consequently, the SoloGAN can be trained effectively with images from all domains such that the domain-invariance content representation can be efficiently extracted. Qualitative and quantitative results over a wide range of datasets against several counterparts and variants of the SoloGAN demonstrate the merits of the method, especially for challenging I2I translation datasets, i.e., datasets involving extreme shape variations or need to keep the complex backgrounds unchanged after translations. Furthermore, we demonstrate the contribution of each component in SoloGAN by ablation studies.
翻译:尽管在图像到图像翻译(I2I)方面有显著进步,并配有基因对抗网络(GANs),但将图像有效地将图像转换成多目标域的一组不同图像仍然具有挑战性,使用单一的生成器和导师。现有的I2I翻译方法为不同域采用了多域特定内容编码器,每个特定域的内容编码器只用同一域的图像进行培训。然而,我们主张,内容(常年差异)特征应当从所有域间的所有域间无变数中学习。因此,每个现有方案的具体域内内容编码器无法有效地提取域内变数特征。为了解决这一问题,我们提出了一个灵活和通用的 SoloGAN 模型,用于在多域内使用未配置数据的多域间高效的多元式I2I翻译。与现有方法相比,SoloGAN 算法使用单一的投影解码器,并共享所有域内的编码器和生成器。因此,SoloGAN可以有效地从所有域内的所有域内的所有域图象中进行培训,以便有效地提取域内域内域内变变数内容的域内变数内容,特别是变数的数值,在数据转换中可以有效展示数据格式中反映数据的变量的每个数据的变数。