使用合成的ImageNet克隆体学习可迁移的表示：试试骗到你成功 (Fake it till you make it: Learning transferable representations from synthetic ImageNet clones)

Recent image generation models such as Stable Diffusion have exhibited an impressive ability to generate fairly realistic images starting from a simple text prompt. Could such models render real images obsolete for training image prediction models? In this paper, we answer part of this provocative question by investigating the need for real images when training models for ImageNet classification. Provided only with the class names that have been used to build the dataset, we explore the ability of Stable Diffusion to generate synthetic clones of ImageNet and measure how useful these are for training classification models from scratch. We show that with minimal and class-agnostic prompt engineering, ImageNet clones are able to close a large part of the gap between models produced by synthetic images and models trained with real images, for the several standard classification benchmarks that we consider in this study. More importantly, we show that models trained on synthetic images exhibit strong generalization properties and perform on par with models trained on real data for transfer. Project page: https://europe.naverlabs.com/imagenet-sd/

翻译：---- 最近的图像生成模型，如Stable Diffusion，展现了从简单的文本提示开始生成相当逼真的图像的惊人能力。当训练图像预测模型时，这些模型能否使真实图像过时？在本文中，我们通过研究训练ImageNet分类模型时需要真实图像的必要性来回答这部分问题。我们提供了构建数据集所使用的类名，并探讨了在仅提供这些信息的情况下，Stable Diffusion用于生成ImageNet克隆体，并测量这些克隆体对于从头开始训练分类模型的有用性。我们发现，只需要最小的、与类别无关的提示，ImageNet克隆体就能够缩小模型产生的合成图像和使用真实图像训练的模型之间的差距。我们在几个标准分类基准测试中考虑了这项研究。更重要的是，我们展示了在合成图像上训练的模型具有很强的泛化性能，并且在转移时表现与在真实数据上训练的模型相当。项目页面：https://europe.naverlabs.com/imagenet-sd/

相关内容

ImageNet (数据集)

关注 21

ImageNet项目是一个用于视觉对象识别软件研究的大型可视化数据库。超过1400万的图像URL被ImageNet手动注释，以指示图片中的对象;在至少一百万个图像中，还提供了边界框。ImageNet包含2万多个类别; [2]一个典型的类别，如“气球”或“草莓”，包含数百个图像。第三方图像URL的注释数据库可以直接从ImageNet免费获得;但是，实际的图像不属于ImageNet。自2010年以来，ImageNet项目每年举办一次软件比赛，即ImageNet大规模视觉识别挑战赛（ILSVRC），软件程序竞相正确分类检测物体和场景。 ImageNet挑战使用了一个“修剪”的1000个非重叠类的列表。2012年在解决ImageNet挑战方面取得了巨大的突破，被广泛认为是2010年的深度学习革命的开始。

【ICLR 2022】MIT论文解读：谈到人工智能，我们可以抛弃数据集吗？基于ML创建合成数据，Generative Models As A Data Source For Multiview Representation Learning

专知会员服务

41+阅读 · 2022年3月15日

【AAAI2022】学会学习可迁移攻击

专知会员服务

16+阅读 · 2021年12月15日

GANs最新进展，30页ppt，GANs: the story so far

专知会员服务

43+阅读 · 2020年8月2日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日