变分扩散模型 (Variational Diffusion Models) - 专知论文

会员服务 ·

0

变分 · 似然 · 密度估计 · 基准测试 · 噪声 ·

2023 年 4 月 14 日

Variational Diffusion Models

翻译：变分扩散模型

Diederik P. Kingma,Tim Salimans,Ben Poole,Jonathan Ho

from arxiv, Published at NeurIPS'21

Diffusion-based generative models have demonstrated a capacity for perceptually impressive synthesis, but can they also be great likelihood-based models? We answer this in the affirmative, and introduce a family of diffusion-based generative models that obtain state-of-the-art likelihoods on standard image density estimation benchmarks. Unlike other diffusion-based models, our method allows for efficient optimization of the noise schedule jointly with the rest of the model. We show that the variational lower bound (VLB) simplifies to a remarkably short expression in terms of the signal-to-noise ratio of the diffused data, thereby improving our theoretical understanding of this model class. Using this insight, we prove an equivalence between several models proposed in the literature. In addition, we show that the continuous-time VLB is invariant to the noise schedule, except for the signal-to-noise ratio at its endpoints. This enables us to learn a noise schedule that minimizes the variance of the resulting VLB estimator, leading to faster optimization. Combining these advances with architectural improvements, we obtain state-of-the-art likelihoods on image density estimation benchmarks, outperforming autoregressive models that have dominated these benchmarks for many years, with often significantly faster optimization. In addition, we show how to use the model as part of a bits-back compression scheme, and demonstrate lossless compression rates close to the theoretical optimum. Code is available at https://github.com/google-research/vdm .

翻译：基于扩散的生成模型展现出了极强的感知合成能力，但它们能否成为出色的似然模型呢？我们回答了这个问题，提出了一族基于扩散的生成模型，在标准图像密度估计基准测试中获得了最先进的似然值。与其他基于扩散的模型不同，我们的方法允许有效地联合优化噪声时间表和模型的其余部分。我们展示了变分下界（VLB）可以简化为扩散数据信噪比的一个非常短的表达式，从而提高了我们对这一模型类的理论理解。利用这一见解，我们证明了文献中提出的几种模型之间的等价性。此外，我们展示了连续时间下的VLB对于噪声时间表是不变的，除了其端点处的信噪比。这使我们能够学习一种噪声时间表，来最小化VLB估计器的方差，从而导致更快的优化。将这些进展与结构改进相结合，我们在图像密度估计基准测试中获得了最先进的似然值，优于占据这些基准测试多年的自回归模型，其优化速度往往更快。此外，我们展示了如何将模型作为Bits-back压缩方案的一部分使用，并演示了接近理论最优值的无损压缩速率。代码可在 https://github.com/google-research/vdm 上获得。

0

相关内容

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

专知会员服务

49+阅读 · 2022年11月17日

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

专知会员服务

119+阅读 · 2022年9月9日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

最新【深度生成模型】Deep Generative Models，104页ppt

最新【深度生成模型】Deep Generative Models，104页ppt

专知会员服务

71+阅读 · 2020年10月24日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

专知会员服务

37+阅读 · 2020年2月27日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

【斯坦福大学】领域自适应小样本生成（DAWSON: A Domain Adaptive Few Shot Generation Framework）

【斯坦福大学】领域自适应小样本生成（DAWSON: A Domain Adaptive Few Shot Generation Framework）

专知会员服务

36+阅读 · 2020年1月7日

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

专知会员服务

32+阅读 · 2019年12月26日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

生成扩散模型漫谈：统一扩散模型（应用篇）

生成扩散模型漫谈：统一扩散模型（应用篇）

PaperWeekly

0+阅读 · 2022年11月19日

Stable Diffusion采样速度翻倍！仅需10到25步的扩散模型采样算法

Stable Diffusion采样速度翻倍！仅需10到25步的扩散模型采样算法

机器之心

0+阅读 · 2022年11月14日

斯坦福/谷歌大脑：两次蒸馏，引导扩散模型采样提速256倍！

斯坦福/谷歌大脑：两次蒸馏，引导扩散模型采样提速256倍！

新智元

2+阅读 · 2022年10月20日

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

机器之心

2+阅读 · 2022年10月12日

采样提速256倍，蒸馏扩散模型生成图像质量媲美教师模型，只需4步

采样提速256倍，蒸馏扩散模型生成图像质量媲美教师模型，只需4步

机器之心

0+阅读 · 2022年10月11日

SIGIR2019 接收论文列表

SIGIR2019 接收论文列表

专知

18+阅读 · 2019年4月20日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

12+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于压缩感知的信号重建快速算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

带跳扩散模型的非参数统计推断研究

国家自然科学基金

0+阅读 · 2013年12月31日

多元线性整值时间序列的统计分析

国家自然科学基金

2+阅读 · 2013年12月31日

基于Realized GARCH框架的波动率和相关性模型理论和应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

基于CS（压缩传感）理论的快速核磁共振成像技术研究

国家自然科学基金

1+阅读 · 2009年12月31日

基于阈值光电子－光离子符合成像的团簇光谱和解离研究

国家自然科学基金

0+阅读 · 2009年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

膜曝气生物膜稳定短程硝化反硝化的机理及调控

国家自然科学基金

0+阅读 · 2009年12月31日

Consistency Models

Arxiv

0+阅读 · 2023年5月31日

Likelihood-Based Diffusion Language Models

Arxiv

0+阅读 · 2023年5月30日

Are Diffusion Models Vulnerable to Membership Inference Attacks?

Arxiv

0+阅读 · 2023年5月30日

DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection

Arxiv

0+阅读 · 2023年5月28日

DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models

Arxiv

0+阅读 · 2023年5月26日

Diffusion Models in Vision: A Survey

Arxiv

29+阅读 · 2022年9月10日

A Survey on Generative Diffusion Model

Arxiv

45+阅读 · 2022年9月6日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

VIP会员

文章信息

相关主题

相关VIP内容

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

【NeurIPS 2022】Stable Diffusion采样速度翻倍！清华提出扩散模型高效求解器

专知会员服务

49+阅读 · 2022年11月17日

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

扩散模型综述又一弹！西湖大学李子青等最新《生成式扩散模型》综述，18页pdf详解扩散模型基础、方法体系和应用

专知会员服务

119+阅读 · 2022年9月9日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

最新【深度生成模型】Deep Generative Models，104页ppt

最新【深度生成模型】Deep Generative Models，104页ppt

专知会员服务

71+阅读 · 2020年10月24日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

【清华大学】诊断和增强VAE模型，Diagnosing and Enhancing VAE Models

专知会员服务

37+阅读 · 2020年2月27日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

【斯坦福大学】领域自适应小样本生成（DAWSON: A Domain Adaptive Few Shot Generation Framework）

【斯坦福大学】领域自适应小样本生成（DAWSON: A Domain Adaptive Few Shot Generation Framework）

专知会员服务

36+阅读 · 2020年1月7日

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

专知会员服务

32+阅读 · 2019年12月26日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】逆强化学习中的部分可识别性与模型设定错误

投大模型岗？50道大型语言模型（LLM）面试问题汇总

深度学习的多视角三维重建技术综述

【ICML2025】扩散模型中参数高效微调的零样本适应

相关资讯

生成扩散模型漫谈：统一扩散模型（应用篇）

生成扩散模型漫谈：统一扩散模型（应用篇）

PaperWeekly

0+阅读 · 2022年11月19日

Stable Diffusion采样速度翻倍！仅需10到25步的扩散模型采样算法

Stable Diffusion采样速度翻倍！仅需10到25步的扩散模型采样算法

机器之心

0+阅读 · 2022年11月14日

斯坦福/谷歌大脑：两次蒸馏，引导扩散模型采样提速256倍！

斯坦福/谷歌大脑：两次蒸馏，引导扩散模型采样提速256倍！

新智元

2+阅读 · 2022年10月20日

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

Soft Diffusion：谷歌新框架从通用扩散过程中正确调度、学习和采样

机器之心

2+阅读 · 2022年10月12日

采样提速256倍，蒸馏扩散模型生成图像质量媲美教师模型，只需4步

采样提速256倍，蒸馏扩散模型生成图像质量媲美教师模型，只需4步

机器之心

0+阅读 · 2022年10月11日

SIGIR2019 接收论文列表

SIGIR2019 接收论文列表

专知

18+阅读 · 2019年4月20日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

12+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Consistency Models

Arxiv

0+阅读 · 2023年5月31日

Likelihood-Based Diffusion Language Models

Arxiv

0+阅读 · 2023年5月30日

Are Diffusion Models Vulnerable to Membership Inference Attacks?

Arxiv

0+阅读 · 2023年5月30日

DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection

Arxiv

0+阅读 · 2023年5月28日

DiffusionNAG: Task-guided Neural Architecture Generation with Diffusion Models

Arxiv

0+阅读 · 2023年5月26日

Diffusion Models in Vision: A Survey

Arxiv

29+阅读 · 2022年9月10日

A Survey on Generative Diffusion Model

Arxiv

45+阅读 · 2022年9月6日

Diffusion Models: A Comprehensive Survey of Methods and Applications

Arxiv

67+阅读 · 2022年9月2日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

相关基金

基于压缩感知的信号重建快速算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

带跳扩散模型的非参数统计推断研究

国家自然科学基金

0+阅读 · 2013年12月31日

多元线性整值时间序列的统计分析

国家自然科学基金

2+阅读 · 2013年12月31日

基于Realized GARCH框架的波动率和相关性模型理论和应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

基于CS（压缩传感）理论的快速核磁共振成像技术研究

国家自然科学基金

1+阅读 · 2009年12月31日

基于阈值光电子－光离子符合成像的团簇光谱和解离研究

国家自然科学基金

0+阅读 · 2009年12月31日

压缩采样框架下的自适应稀疏信号感知与重建

国家自然科学基金

0+阅读 · 2009年12月31日

膜曝气生物膜稳定短程硝化反硝化的机理及调控

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员