Langevin Monte Carlo: 随机协调下降和差异 (Langevin Monte Carlo: random coordinate descent and variance reduction) - 专知论文

会员服务 ·

0

方差减小 · 可约的 · 坐标下降 · 蒙特卡罗 · 方差 ·

2021 年 9 月 13 日

Langevin Monte Carlo: random coordinate descent and variance reduction

翻译：Langevin Monte Carlo: 随机协调下降和差异

Zhiyan Ding,Qin Li

from arxiv, arXiv admin note: text overlap with arXiv:2006.06068

Langevin Monte Carlo (LMC) is a popular Bayesian sampling method. For the log-concave distribution function, the method converges exponentially fast, up to a controllable discretization error. However, the method requires the evaluation of a full gradient in each iteration, and for a problem on $\mathbb{R}^d$, this amounts to $d$ times partial derivative evaluations per iteration. The cost is high when $d\gg1$. In this paper, we investigate how to enhance computational efficiency through the application of RCD (random coordinate descent) on LMC. There are two sides of the theory: 1 By blindly applying RCD to LMC, one surrogates the full gradient by a randomly selected directional derivative per iteration. Although the cost is reduced per iteration, the total number of iteration is increased to achieve a preset error tolerance. Ultimately there is no computational gain; 2 We then incorporate variance reduction techniques, such as SAGA (stochastic average gradient) and SVRG (stochastic variance reduced gradient), into RCD-LMC. It will be proved that the cost is reduced compared with the classical LMC, and in the underdamped case, convergence is achieved with the same number of iterations, while each iteration requires merely one-directional derivative. This means we obtain the best possible computational cost in the underdamped-LMC framework.

翻译：LAMC是流行的Bayesian Bayesian 抽样方法(LMC) Langevin Langevin Monte Carlo (LMC) 。对于对日对流分配功能,该方法会迅速成倍地聚集,达到可控制的离散错误。然而,该方法要求对每迭代中完全梯度进行评估,对美元(mathbbb{R ⁇ ⁇ d$)的问题则需要评估完全梯度,而对于美元($mathbb{R ⁇ d$)的问题,每迭代中部分衍生物评价的总额是部分衍生物评价的两倍。当美元为美元($d\gg1美元)时,成本是很高的。在本文中,我们研究如何通过在LMC上应用刚果民盟(RCD-M)的(随机平均梯度协调下降梯度)来提高计算效率。理论的两面有两面: 一是盲目地将刚果民盟(LMC)应用LDRD, 一种随机选择的衍生物来取代整个梯度。虽然每迭代代代计算成本,但根据RCD-LMCL的公式,成本的计算方法需要降低成本。

0

相关内容

方差减小

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

【清华大学】自动微分蒙特卡洛，理论与应用，Automatic Differentiable Monte Carlo: Theory and Application (附pdf）

专知会员服务

28+阅读 · 2019年11月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

已删除

将门创投

5+阅读 · 2020年3月2日

ICML2019：Google和Facebook在推进哪些方向？

ICML2019：Google和Facebook在推进哪些方向？

专知

5+阅读 · 2019年6月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

[DLdigest-8] 每日一道算法

[DLdigest-8] 每日一道算法

深度学习每日摘要

4+阅读 · 2017年11月2日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

Arxiv

0+阅读 · 2021年11月2日

Convex regularization in statistical inverse learning problems

Arxiv

0+阅读 · 2021年11月1日

Communication-Compressed Adaptive Gradient Method for Distributed Nonconvex Optimization

Arxiv

0+阅读 · 2021年11月1日

Constrained Ensemble Langevin Monte Carlo

Arxiv

0+阅读 · 2021年10月29日

A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance

A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance

Arxiv

0+阅读 · 2021年10月29日

The divide-and-conquer sequential Monte Carlo algorithm: theoretical properties and limit theorems

Arxiv

0+阅读 · 2021年10月29日

Adaptive Importance Sampling meets Mirror Descent: a Bias-variance tradeoff

Arxiv

0+阅读 · 2021年10月29日

Spherical polar coordinate transformation for integration of singular functions on tetrahedra

Arxiv

0+阅读 · 2021年10月28日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

Optimal Algorithms for Distributed Optimization

Arxiv

3+阅读 · 2017年12月1日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【Google】梯度下降，48页ppt

【Google】梯度下降，48页ppt

专知会员服务

81+阅读 · 2020年12月5日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

【清华大学】自动微分蒙特卡洛，理论与应用，Automatic Differentiable Monte Carlo: Theory and Application (附pdf）

专知会员服务

28+阅读 · 2019年11月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

已删除

将门创投

5+阅读 · 2020年3月2日

ICML2019：Google和Facebook在推进哪些方向？

ICML2019：Google和Facebook在推进哪些方向？

专知

5+阅读 · 2019年6月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

[DLdigest-8] 每日一道算法

[DLdigest-8] 每日一道算法

深度学习每日摘要

4+阅读 · 2017年11月2日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

Arxiv

0+阅读 · 2021年11月2日

Convex regularization in statistical inverse learning problems

Arxiv

0+阅读 · 2021年11月1日

Communication-Compressed Adaptive Gradient Method for Distributed Nonconvex Optimization

Arxiv

0+阅读 · 2021年11月1日

Constrained Ensemble Langevin Monte Carlo

Arxiv

0+阅读 · 2021年10月29日

A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance

A Domain-Shrinking based Bayesian Optimization Algorithm with Order-Optimal Regret Performance

Arxiv

0+阅读 · 2021年10月29日

The divide-and-conquer sequential Monte Carlo algorithm: theoretical properties and limit theorems

Arxiv

0+阅读 · 2021年10月29日

Adaptive Importance Sampling meets Mirror Descent: a Bias-variance tradeoff

Arxiv

0+阅读 · 2021年10月29日

Spherical polar coordinate transformation for integration of singular functions on tetrahedra

Arxiv

0+阅读 · 2021年10月28日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

Optimal Algorithms for Distributed Optimization

Arxiv

3+阅读 · 2017年12月1日

微信扫码咨询专知VIP会员