原原小碎碎石梯度 Langevin 算法双元解释 (Primal Dual Interpretation of the Proximal Stochastic Gradient Langevin Algorithm) - 专知论文

会员服务 ·

0

强对偶性 · 对偶算法 · 泛化理论 · CASES · contrastive ·

2021 年 2 月 22 日

Primal Dual Interpretation of the Proximal Stochastic Gradient Langevin Algorithm

翻译：原原小碎碎石梯度 Langevin 算法双元解释

Adil Salim,Peter Richtárik

We consider the task of sampling with respect to a log concave probability distribution. The potential of the target distribution is assumed to be composite, \textit{i.e.}, written as the sum of a smooth convex term, and a nonsmooth convex term possibly taking infinite values. The target distribution can be seen as a minimizer of the Kullback-Leibler divergence defined on the Wasserstein space (\textit{i.e.}, the space of probability measures). In the first part of this paper, we establish a strong duality result for this minimization problem. In the second part of this paper, we use the duality gap arising from the first part to study the complexity of the Proximal Stochastic Gradient Langevin Algorithm (PSGLA), which can be seen as a generalization of the Projected Langevin Algorithm. Our approach relies on viewing PSGLA as a primal dual algorithm and covers many cases where the target distribution is not fully supported. In particular, we show that if the potential is strongly convex, the complexity of PSGLA is $O(1/\varepsilon^2)$ in terms of the 2-Wasserstein distance. In contrast, the complexity of the Projected Langevin Algorithm is $O(1/\varepsilon^{12})$ in terms of total variation when the potential is convex.

翻译：我们考虑的是关于对正弦概率分布的取样任务。在本文的第一部分, 我们假设目标分布的可能性是复合的,\ textit{ i. e.} 。以平滑的 convex 术语的总和写成, 而非moots convex 术语可能包含无限值。目标分布可以被视为瓦塞斯坦空间(\ textit{ i. e.}) 定义的 Kullback- Leiber12 差异的最小化。我们的方法取决于将 PSGLA 视为一种原始的双重算法, 并涵盖目标分布未得到充分支持的许多情况。在本文的第二部分, 我们使用第一部分产生的双性差距来研究Proximal Stophatistic Grainatic Grangevient Langeevin Algorithm (PSGLA) 的复杂性。在 Vaxional AL LA LA LA 的复杂性中, 我们展示的是, 当目标分布不完全支持时, $_ x( 美元) 美元的变异性为 AL 。

0

相关内容

强对偶性

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

57+阅读 · 2020年11月21日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

策略梯度方法的算子视图，An operator view of policy gradient methods

策略梯度方法的算子视图，An operator view of policy gradient methods

专知会员服务

11+阅读 · 2020年6月23日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Local Dvoretzky-Kiefer-Wolfowitz confidence bands

Arxiv

0+阅读 · 2021年4月14日

Weighted error estimates for transient transport problems discretized using continuous finite elements with interior penalty stabilization on the gradient jumps

Arxiv

0+阅读 · 2021年4月14日

Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework

Arxiv

0+阅读 · 2021年4月13日

An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General Constraints

Arxiv

0+阅读 · 2021年4月13日

Convergence Properties of Stochastic Hypergradients

Arxiv

0+阅读 · 2021年4月12日

Algorithms and Complexity for the Almost Equal Maximum Flow Problem

Arxiv

0+阅读 · 2021年4月12日

A rank-adaptive robust integrator for dynamical low-rank approximation

Arxiv

0+阅读 · 2021年4月12日

Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent

Arxiv

0+阅读 · 2021年4月12日

A Dual-Mixed Approximation for a Huber Regularization of the Herschel-Bulkey Flow Problem

Arxiv

0+阅读 · 2021年4月9日

Displacement-Driven Approach to Nonlocal Elasticity

Arxiv

0+阅读 · 2021年4月8日

VIP会员

文章信息

相关主题

相关VIP内容

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

57+阅读 · 2020年11月21日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

策略梯度方法的算子视图，An operator view of policy gradient methods

策略梯度方法的算子视图，An operator view of policy gradient methods

专知会员服务

11+阅读 · 2020年6月23日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICML2025】通过双重平衡协同专家解决不平衡的领域增量学习问题

用于语言生成的离散扩散模型

中文版 | 融合革命：无人机与人工智能如何驱动未来战争

AI应用正当时，详解AI应用开发新范式

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Local Dvoretzky-Kiefer-Wolfowitz confidence bands

Arxiv

0+阅读 · 2021年4月14日

Weighted error estimates for transient transport problems discretized using continuous finite elements with interior penalty stabilization on the gradient jumps

Arxiv

0+阅读 · 2021年4月14日

Computation for Latent Variable Model Estimation: A Unified Stochastic Proximal Framework

Arxiv

0+阅读 · 2021年4月13日

An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General Constraints

Arxiv

0+阅读 · 2021年4月13日

Convergence Properties of Stochastic Hypergradients

Arxiv

0+阅读 · 2021年4月12日

Algorithms and Complexity for the Almost Equal Maximum Flow Problem

Arxiv

0+阅读 · 2021年4月12日

A rank-adaptive robust integrator for dynamical low-rank approximation

Arxiv

0+阅读 · 2021年4月12日

Homeomorphic-Invariance of EM: Non-Asymptotic Convergence in KL Divergence for Exponential Families via Mirror Descent

Arxiv

0+阅读 · 2021年4月12日

A Dual-Mixed Approximation for a Huber Regularization of the Herschel-Bulkey Flow Problem

Arxiv

0+阅读 · 2021年4月9日

Displacement-Driven Approach to Nonlocal Elasticity

Arxiv

0+阅读 · 2021年4月8日

微信扫码咨询专知VIP会员