In this work, we show the generative capability of an image classifier network by synthesizing high-resolution, photo-realistic, and diverse images at scale. The overall methodology, called Synthesize-It-Classifier (STIC), does not require an explicit generator network to estimate the density of the data distribution and sample images from that, but instead uses the classifier's knowledge of the boundary to perform gradient ascent w.r.t. class logits and then synthesizes images using Gram Matrix Metropolis Adjusted Langevin Algorithm (GRMALA) by drawing on a blank canvas. During training, the classifier iteratively uses these synthesized images as fake samples and re-estimates the class boundary in a recurrent fashion to improve both the classification accuracy and quality of synthetic images. The STIC shows the mixing of the hard fake samples (i.e. those synthesized by the one hot class conditioning), and the soft fake samples (which are synthesized as a convex combination of classes, i.e. a mixup of classes) improves class interpolation. We demonstrate an Attentive-STIC network that shows an iterative drawing of synthesized images on the ImageNet dataset that has thousands of classes. In addition, we introduce the synthesis using a class conditional score classifier (Score-STIC) instead of a normal image classifier and show improved results on several real-world datasets, i.e. ImageNet, LSUN, and CIFAR 10.
翻译:在这项工作中,我们通过将高分辨率、照片现实化和各种规模图像合成,展示图像分类网络的基因化能力。总体方法叫做“合成图像-IST-分类仪(STIC)”,它不需要一个明确的生成器网络来估计数据分布和从中样本图像的密度,而是使用分类器对边界的了解来进行梯度升升(r.r.t.)类登录,然后通过在空白画布上绘制图像,合成图像。在培训期间,分类器反复使用这些合成图像作为假样品,并用经常性的方式重新估计分类边界,以提高合成图像的分类准确性和质量。科学、创新和研究中心展示了硬假样品的混合(即由一个热级调节器合成的样本合成),以及软假样品(合成为各类的组合,即各类混合的组合),它改进了类间图像。在S类中,我们展示了一个S级的升级的分类,在S类中,我们展示了一个S级的升级的分类,在S类中,我们展示了一个S级的升级的合成模型。