在图像合成中传播模型比抗GANs (Diffusion Models Beat GANs on Image Synthesis)

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For conditional image synthesis, we further improve sample quality with classifier guidance: a simple, compute-efficient method for trading off diversity for sample quality using gradients from a classifier. We achieve an FID of 2.97 on ImageNet 128$\times$128, 4.59 on ImageNet 256$\times$256, and 7.72 on ImageNet 512$\times$512, and we match BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. Finally, we find that classifier guidance combines well with upsampling diffusion models, further improving FID to 3.85 on ImageNet 512$\times$512. We release our code at https://github.com/openai/guided-diffusion

翻译：我们显示,扩散模型能够达到比目前最先进的基因化模型高的图像样本质量。我们通过一系列推理来找到更好的结构来无条件图像合成。对于有条件的图像合成,我们通过分类指导进一步提高样本质量:一种简单、计算高效的方法,使用分类器的梯度来交换样本质量的多样性,我们通过图像网128$times128, 4.59在图像网256$times256美元,7.72在图像网512$times512中公布,我们用图像网512$times512比BigGAN-deepep,即使每个样本只有多达25个远端的通道,同时保持更好的分布范围。最后,我们发现,分类指南结合了扩大的推广模型,在图像网512$time512上进一步将FID提高到3.85。我们在https://github.com/opinai/guided-difulation上公布我们的代码。最后,我们发现,在图像网512$512。

相关内容

ImageNet (数据集)

关注 21

ImageNet项目是一个用于视觉对象识别软件研究的大型可视化数据库。超过1400万的图像URL被ImageNet手动注释，以指示图片中的对象;在至少一百万个图像中，还提供了边界框。ImageNet包含2万多个类别; [2]一个典型的类别，如“气球”或“草莓”，包含数百个图像。第三方图像URL的注释数据库可以直接从ImageNet免费获得;但是，实际的图像不属于ImageNet。自2010年以来，ImageNet项目每年举办一次软件比赛，即ImageNet大规模视觉识别挑战赛（ILSVRC），软件程序竞相正确分类检测物体和场景。 ImageNet挑战使用了一个“修剪”的1000个非重叠类的列表。2012年在解决ImageNet挑战方面取得了巨大的突破，被广泛认为是2010年的深度学习革命的开始。

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

46+阅读 · 2020年7月4日

最新《生成式对抗网络》简介，25页ppt

专知会员服务

167+阅读 · 2020年6月28日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

32+阅读 · 2020年6月19日

数字病理学中的生成性对抗网络:趋势和未来潜力的综述 Generative Adversarial Networks in Digital Pathology: A Survey on Trends and Future Potential

专知会员服务

17+阅读 · 2020年5月1日