The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means for training them. In this work, we propose a general model-agnostic technique for improving the image quality and the distribution fidelity of generated images, obtained by any generative model. Our method, termed BIGRoC (Boosting Image Generation via a Robust Classifier), is based on a post-processing procedure via the guidance of a given robust classifier and without a need for additional training of the generative model. Given a synthesized image, we propose to update it through projected gradient steps over the robust classifier, in an attempt to refine its recognition. We demonstrate this post-processing algorithm on various image synthesis methods and show a significant improvement of the generated images, both quantitatively and qualitatively, on CIFAR-10 and ImageNet. Specifically, BIGRoC improves the image synthesis state of the art on ImageNet 128x128 by 14.81%, attaining an FID score of 2.53 and on 256x256 by 7.87%, achieving an FID of 3.63.
翻译:近些年来,机器学习界对图像合成的兴趣有了显著提高,引进了广泛的深层基因化模型和培训方法。在这项工作中,我们提出一种通过任何基因化模型获得的提高图像质量和分发真实性的一般模型 -- -- 不可知性技术,我们的方法称为BIGRoC(通过一个强力分类器进行图像生成),它基于一个后处理程序,由特定强力分类器提供指导,不需要对基因化模型进行额外培训。我们建议通过预测的关于强力分类器的梯度步骤来更新它,以改进其认识。我们用各种图像合成方法展示了这种后处理算法,并在数量和质量上明显改进了CIFAR-10和图像网络上产生的图像。具体地说,BIGRoC改进了图像网络128x128的图像合成状态,增加了14.81%,实现了2.53的FID分和7.86%的256x256分,实现了3.63的FID。