Unconditional image generation has recently been dominated by generative adversarial networks (GANs). GAN methods train a generator which regresses images from random noise vectors, as well as a discriminator that attempts to differentiate between the generated images and a training set of real images. GANs have shown amazing results at generating realistic looking images. Despite their success, GANs suffer from critical drawbacks including: unstable training and mode-dropping. The weaknesses in GANs have motivated research into alternatives including: variational auto-encoders (VAEs), latent embedding learning methods (e.g. GLO) and nearest-neighbor based implicit maximum likelihood estimation (IMLE). Unfortunately at the moment, GANs still significantly outperform the alternative methods for image generation. In this work, we present a novel method - Generative Latent Nearest Neighbors (GLANN) - for training generative models without adversarial training. GLANN combines the strengths of IMLE and GLO in a way that overcomes the main drawbacks of each method. Consequently, GLANN generates images that are far better than GLO and IMLE. Our method does not suffer from mode collapse which plagues GAN training and is much more stable. Qualitative results show that GLANN outperforms a baseline consisting of 800 GANs and VAEs on commonly used datasets. Our models are also shown to be effective for training truly non-adversarial unsupervised image translation.
翻译:无条件的图像生成最近以基因对抗网络(GANs)为主。 GAN 方法训练了一种能让随机噪声矢量图像回归的生成器,以及试图区分生成图像和真实图像培训集的区别歧视器。 GANs在生成现实的图像方面表现出惊人的结果。 GANs尽管取得了成功,但还是有严重的缺陷,包括:培训不稳定和模式下降。 GANs 的弱点激发了对替代方法的研究,包括:变异自动编码器(VAE)、潜伏嵌入学习方法(例如GLO)和以近邻为主的隐含最大可能性估计(IMLE ) 。 不幸的是,当时GLANs仍然大大优于生成图像生成的替代方法。 在这项工作中,我们展示了一种新型方法――GLAN Rest Nebbors (GLANN) 的导引力,而没有进行对抗性培训。 GLANQ(GAN) 和GLO(GNA) 常规的解算法则比常规的解算法要好得多。