Image generation has been heavily investigated in computer vision, where one core research challenge is to generate images from arbitrarily complex distributions with little supervision. Generative Adversarial Networks (GANs) as an implicit approach have achieved great successes in this direction and therefore been employed widely. However, GANs are known to suffer from issues such as mode collapse, non-structured latent space, being unable to compute likelihoods, etc. In this paper, we propose a new unsupervised non-parametric method named mixture of infinite conditional GANs or MIC-GANs, to tackle several GAN issues together, aiming for image generation with parsimonious prior knowledge. Through comprehensive evaluations across different datasets, we show that MIC-GANs are effective in structuring the latent space and avoiding mode collapse, and outperform state-of-the-art methods. MICGANs are adaptive, versatile, and robust. They offer a promising solution to several well-known GAN issues. Code available: github.com/yinghdb/MICGANs.
翻译:在计算机视野中,对图像生成进行了大量调查,其中一项核心研究挑战是在很少监督的情况下,从任意的复杂分布中生成图像。作为一种隐含方法的基因反转网络(GANs)在这方面取得了巨大成功,因此得到了广泛使用。然而,已知GANs受到模式崩溃、非结构化的潜在空间、无法计算可能性等问题的影响。在本文件中,我们提出了一种新的不受监督的非参数方法,名为无限条件GANs或MIC-GANs的混合体,以共同解决若干GAN问题,目的是以熟悉的先前知识生成图像。我们通过对不同的数据集进行全面评估,显示MIC-GANs在构建潜在空间和避免模式崩溃以及超常规状态方法方面是有效的。MICGANs具有适应性、多功能性和强性。它们为一些众所周知的GAN问题提供了很有希望的解决办法。可用的代码: github.com/yinghdb/MICGANs。