传播模型生成像绘画者一样的图像:大纲第一分析理论,细节</s> (Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later)

How do diffusion generative models convert pure noise into meaningful images? We argue that generation involves first committing to an outline, and then to finer and finer details. The corresponding reverse diffusion process can be modeled by dynamics on a (time-dependent) high-dimensional landscape full of Gaussian-like modes, which makes the following predictions: (i) individual trajectories tend to be very low-dimensional; (ii) scene elements that vary more within training data tend to emerge earlier; and (iii) early perturbations substantially change image content more often than late perturbations. We show that the behavior of a variety of trained unconditional and conditional diffusion models like Stable Diffusion is consistent with these predictions. Finally, we use our theory to search for the latent image manifold of diffusion models, and propose a new way to generate interpretable image variations. Our viewpoint suggests generation by GANs and diffusion models have unexpected similarities.

翻译：扩散基因模型如何将纯噪音转换成有意义的图像? 我们争论说, 生成过程首先需要承诺一个大纲, 然后更精细、更细细的细节。相应的反向扩散过程可以通过一个充满高山式模式的(依赖时间的)高维地貌的动态模型来模拟, 从而作出以下预测:(一) 单个轨迹往往非常低的维度;(二) 培训数据中变化较多的场景元素更早出现;(三) 早期扰动会大大改变图像内容, 而不是较晚的扰动。我们表明,各种经过训练的无条件和有条件的传播模型(如稳定扩散模型)的行为与这些预测是一致的。最后, 我们利用我们的理论来寻找扩散模型的潜在图像组合, 并提出产生可解释的图像变化的新方式。我们的观点表明, GANs 和传播模型的生成过程有着意想不到的相似之处。</s>

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日