The conditional generative adversarial network (cGAN) is a powerful tool of generating high-quality images, but existing approaches mostly suffer unsatisfying performance or the risk of mode collapse. This paper presents Omni-GAN, a variant of cGAN that reveals the devil in designing a proper discriminator for training the model. The key is to ensure that the discriminator receives strong supervision to perceive the concepts and moderate regularization to avoid collapse. Omni-GAN is easily implemented and freely integrated with off-the-shelf encoding methods (e.g., implicit neural representation, INR). Experiments validate the superior performance of Omni-GAN and Omni-INR-GAN in a wide range of image generation and restoration tasks. In particular, Omni-INR-GAN sets new records on the ImageNet dataset with impressive Inception scores of 262.85 and 343.22 for the image sizes of 128 and 256, respectively, surpassing the previous records by 100+ points. Moreover, leveraging the generator prior, Omni-INR-GAN can extrapolate low-resolution images to arbitrary resolution, even up to x60+ higher resolution. Code is available.
翻译:有条件的基因对抗网络(cGAN)是生成高质量图像的有力工具,但现有方法大多不令人满意地表现或出现模式崩溃的风险。本文展示了Omni-GAN,这是CGAN的一种变体,在设计一个适当的模型时显示魔鬼在设计一个适当的歧视者以培训模型方面显示魔鬼。关键是确保歧视者得到强有力的监督,以了解概念和适度规范,避免崩溃。Omni-GAN很容易实施,并自由地与现成编码方法(例如,隐含神经代表、IRN)融合。实验验证了Omni-GAN和Omni-INR-GAN在广泛的图像生成和恢复任务方面的优异性表现。特别是,Omni-INR-GAN在图像网络数据集上设置了新的记录,其图像尺寸分别为128和256,分别达到令人印象深刻的262.85分和343.22分,比先前的记录高出100分。此外,利用发电机之前的Omni-INR-GAN可以将低分辨率图像外加任意分辨率,甚至可得到xdeal+分辨率。