理解扩散目标作为ELBO的加权积分 (Understanding the Diffusion Objective as a Weighted Integral of ELBOs) - 专知论文

会员服务 ·

0

权函数 · 噪声 · 似然 · 损失 · 非均匀 ·

2023 年 3 月 30 日

Understanding the Diffusion Objective as a Weighted Integral of ELBOs

翻译：理解扩散目标作为ELBO的加权积分

Diederik P. Kingma,Ruiqi Gao

Diffusion models in the literature are optimized with various objectives that are special cases of a weighted loss, where the weighting function specifies the weight per noise level. Uniform weighting corresponds to maximizing the ELBO, a principled approximation of maximum likelihood. In current practice diffusion models are optimized with non-uniform weighting due to better results in terms of sample quality. In this work we expose a direct relationship between the weighted loss (with any weighting) and the ELBO objective. We show that the weighted loss can be written as a weighted integral of ELBOs, with one ELBO per noise level. If the weighting function is monotonic, then the weighted loss is a likelihood-based objective: it maximizes the ELBO under simple data augmentation, namely Gaussian noise perturbation. Our main contribution is a deeper theoretical understanding of the diffusion objective, but we also performed some experiments comparing monotonic with non-monotonic weightings, finding that monotonic weighting performs competitively with the best published results.

翻译：文献中的扩散模型是通过各种不同的目标函数进行优化的，其中加权函数指定每个噪声级别的权重。均匀加权对应于最大似然的有原则的近似ELBO的最大化。目前的做法是通过非均匀加权进行扩散模型优化，从而在样本质量方面获得更好的结果。本文揭示了加权损失（任意加权）和ELBO目标之间的直接关系。我们证明了加权损失可以被写成一组ELBO的加权积分，其中每个噪声级别都对应一个ELBO。如果加权函数是单调的，那么加权损失就是一种基于似然的优化目标：它在简单的数据增强下（即高斯噪声扰动）最大化ELBO。我们的主要贡献是更深入理解扩散目标的理论基础，同时我们还进行了一些实验比较单调和非单调的加权方式，发现单调加权的表现与最佳已发表结果相当。

0

相关内容

权函数

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling

专知会员服务

63+阅读 · 2020年5月23日

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling in Graph Representation Learning

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling in Graph Representation Learning

专知会员服务

58+阅读 · 2020年5月21日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

生成扩散模型漫谈：统一扩散模型（理论篇）

生成扩散模型漫谈：统一扩散模型（理论篇）

PaperWeekly

1+阅读 · 2022年11月6日

生成扩散模型漫谈：一般框架之ODE篇

生成扩散模型漫谈：一般框架之ODE篇

PaperWeekly

1+阅读 · 2022年9月1日

扩散模型背后数学太难了，啃不动？谷歌用统一视角讲明白了

扩散模型背后数学太难了，啃不动？谷歌用统一视角讲明白了

机器之心

1+阅读 · 2022年8月28日

生成扩散模型漫谈：一般框架之SDE篇

生成扩散模型漫谈：一般框架之SDE篇

PaperWeekly

0+阅读 · 2022年8月14日

深度解析：什么是Diffusion Model？

深度解析：什么是Diffusion Model？

PaperWeekly

6+阅读 · 2022年7月26日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Self-Attention GAN 中的 self-attention 机制

Self-Attention GAN 中的 self-attention 机制

PaperWeekly

12+阅读 · 2019年3月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

带有噪声扰动的动力系统分支问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

铁基超导体中自旋轨道耦合效应的微观理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

具有群作用CR流形上的Morse不等式

国家自然科学基金

0+阅读 · 2015年12月31日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

二阶随机微分方程的Runge-Kutta方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

流形上的Bakry-Emery曲率，泛函不等式和热核分析

国家自然科学基金

0+阅读 · 2012年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

与实数非整数基表示相关的若干分形问题

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

A Compound Gaussian Network for Solving Linear Inverse Problems

Arxiv

0+阅读 · 2023年5月19日

Accelerating Convergence in Global Non-Convex Optimization with Reversible Diffusion

Arxiv

0+阅读 · 2023年5月19日

Your diffusion model secretly knows the dimension of the data manifold

Arxiv

0+阅读 · 2023年5月18日

A Study on Transformer Configuration and Training Objective

Arxiv

0+阅读 · 2023年5月18日

The Wisdom of Strategic Voting

Arxiv

0+阅读 · 2023年5月18日

Efficient sampling of non log-concave posterior distributions with mixture of noises

Arxiv

0+阅读 · 2023年5月18日

Reaching Kesten-Stigum Threshold in the Stochastic Block Model under Node Corruptions

Arxiv

0+阅读 · 2023年5月17日

SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency

Arxiv

0+阅读 · 2023年5月17日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

W-net: Bridged U-net for 2D Medical Image Segmentation

W-net: Bridged U-net for 2D Medical Image Segmentation

Arxiv

20+阅读 · 2018年7月12日

VIP会员

文章信息

相关主题

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

【ICML2020】统一预训练伪掩码语言模型

【ICML2020】统一预训练伪掩码语言模型

专知会员服务

27+阅读 · 2020年7月23日

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling

专知会员服务

63+阅读 · 2020年5月23日

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling in Graph Representation Learning

【KDD2020-清华大学】理解图表示学习中的负采样，Understanding Negative Sampling in Graph Representation Learning

专知会员服务

58+阅读 · 2020年5月21日

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

【理解计算机视觉损失函数】《Understanding Loss Functions in Computer Vision!》by Sowmya Yellapragad

专知会员服务

44+阅读 · 2020年3月4日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】迈向鲁棒的零样本强化学习

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

【普林斯顿博士论文】量化、评估与缓解现代机器学习系统中的风险

遥感中基于深度学习的领域自适应方法：全面综述

相关资讯

生成扩散模型漫谈：统一扩散模型（理论篇）

生成扩散模型漫谈：统一扩散模型（理论篇）

PaperWeekly

1+阅读 · 2022年11月6日

生成扩散模型漫谈：一般框架之ODE篇

生成扩散模型漫谈：一般框架之ODE篇

PaperWeekly

1+阅读 · 2022年9月1日

扩散模型背后数学太难了，啃不动？谷歌用统一视角讲明白了

扩散模型背后数学太难了，啃不动？谷歌用统一视角讲明白了

机器之心

1+阅读 · 2022年8月28日

生成扩散模型漫谈：一般框架之SDE篇

生成扩散模型漫谈：一般框架之SDE篇

PaperWeekly

0+阅读 · 2022年8月14日

深度解析：什么是Diffusion Model？

深度解析：什么是Diffusion Model？

PaperWeekly

6+阅读 · 2022年7月26日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Self-Attention GAN 中的 self-attention 机制

Self-Attention GAN 中的 self-attention 机制

PaperWeekly

12+阅读 · 2019年3月6日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

A Compound Gaussian Network for Solving Linear Inverse Problems

Arxiv

0+阅读 · 2023年5月19日

Accelerating Convergence in Global Non-Convex Optimization with Reversible Diffusion

Arxiv

0+阅读 · 2023年5月19日

Your diffusion model secretly knows the dimension of the data manifold

Arxiv

0+阅读 · 2023年5月18日

A Study on Transformer Configuration and Training Objective

Arxiv

0+阅读 · 2023年5月18日

The Wisdom of Strategic Voting

Arxiv

0+阅读 · 2023年5月18日

Efficient sampling of non log-concave posterior distributions with mixture of noises

Arxiv

0+阅读 · 2023年5月18日

Reaching Kesten-Stigum Threshold in the Stochastic Block Model under Node Corruptions

Arxiv

0+阅读 · 2023年5月17日

SelfPromer: Self-Prompt Dehazing Transformers with Depth-Consistency

Arxiv

0+阅读 · 2023年5月17日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

W-net: Bridged U-net for 2D Medical Image Segmentation

W-net: Bridged U-net for 2D Medical Image Segmentation

Arxiv

20+阅读 · 2018年7月12日

相关基金

带有噪声扰动的动力系统分支问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

铁基超导体中自旋轨道耦合效应的微观理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

具有群作用CR流形上的Morse不等式

国家自然科学基金

0+阅读 · 2015年12月31日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

二阶随机微分方程的Runge-Kutta方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

流形上的Bakry-Emery曲率，泛函不等式和热核分析

国家自然科学基金

0+阅读 · 2012年12月31日

幂零李群上热核估计的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

与实数非整数基表示相关的若干分形问题

国家自然科学基金

0+阅读 · 2011年12月31日

因果推断的统计方法

国家自然科学基金

26+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员