快速复制交换器存储梯度 Langevin 动态 (Fast Replica Exchange Stochastic Gradient Langevin Dynamics) - 专知论文

会员服务 ·

0

可交换的 · 估计/估计量 · 回火 · FAST · 计算成本 ·

2023 年 1 月 5 日

Fast Replica Exchange Stochastic Gradient Langevin Dynamics

翻译：快速复制交换器存储梯度 Langevin 动态

Guanxun Li,Guang Lin,Zecheng Zhang,Quan Zhou

Application of the replica exchange (i.e., parallel tempering) technique to Langevin Monte Carlo algorithms, especially stochastic gradient Langevin dynamics (SGLD), has scored great success in non-convex learning problems, but one potential limitation is the computational cost caused by running multiple chains. Upon observing that a large variance of the gradient estimator in SGLD essentially increases the temperature of the stationary distribution, we propose expediting tempering schemes for SGLD by directly estimating the bias caused by the stochastic gradient estimator. This simple idea enables us to simulate high-temperature chains at a negligible computational cost (compared to that of the low-temperature chain) while preserving the convergence to the target distribution. Our method is fundamentally different from the recently proposed m-reSGLD (multi-variance replica exchange SGLD) method in that the latter suffers from the low accuracy of the gradient estimator (e.g., the chain can fail to converge to the target) while our method benefits from it. Further, we derive a swapping rate that can be easily evaluated, providing another significant improvement over m-reSGLD. To theoretically demonstrate the advantage of our method, we develop convergence bounds in Wasserstein distances. Numerical examples for Gaussian mixture and inverse PDE models are also provided, which show that our method can converge quicker than the vanilla multi-variance replica exchange method.

翻译：对Langevin Monte Carlo算法应用复制交换(即平行调温)技术,特别是随机梯度梯度Langevin Langevin动态(SGLD)在非康纳学习问题中取得了巨大成功,但一个潜在的限制是运行多个链的计算成本。在发现SGLD的梯度估计器差异很大,基本上提高了定点分布的温度之后,我们建议直接估计振动梯度估计器造成的偏差,从而加快SGLD的调制办法。这一简单的想法使我们能够模拟高温链,而计算成本微不足道(与低温链相比),同时保持与目标分布的趋同。我们的方法与最近提议的 m-reSGLD(多变换汇SGLD)方法有根本的不同,后者由于梯度估计梯度估计器的精度低(例如,链度无法与目标相趋同)而加快调。此外,我们在方法上也得出一种汇率转换率,从而能够轻松地展示我们更趋近的方法。

0

相关内容

可交换的

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

混凝土耐久性损伤光纤光栅声发射传感机理与应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

LincRNA在CYP46A1基因第二内含子T/C多态性对AD易感性影响中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩湍流粒子输运的拉格朗日（Lagrangian）研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有完整与非完整约束刚柔耦合非光滑多体系统动力学数值算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

刚性和柔性配体协同调控的钯(II)、铂(II)配合物的分子设计、结构及细胞毒性

国家自然科学基金

0+阅读 · 2011年12月31日

典型非线性浅水波方程的低正则解散射及渐进性研究

国家自然科学基金

0+阅读 · 2011年12月31日

复合污染条件下DOM对典型离子性抗生素吸附迁移行为的影响

国家自然科学基金

0+阅读 · 2008年12月31日

Implicit Stochastic Gradient Descent for Training Physics-informed Neural Networks

Arxiv

0+阅读 · 2023年3月3日

Evolutionary Multi-Objective Algorithms for the Knapsack Problems with Stochastic Profits

Arxiv

0+阅读 · 2023年3月3日

Training Efficient Controllers via Analytic Policy Gradient

Arxiv

0+阅读 · 2023年3月2日

Neuroevolution Surpasses Stochastic Gradient Descent for Physics-Informed Neural Networks

Arxiv

0+阅读 · 2023年3月2日

Quantifying the mini-batching error in Bayesian inference for Adaptive Langevin dynamics

Arxiv

0+阅读 · 2023年3月1日

Conditional Poisson Stochastic Beam Search

Arxiv

0+阅读 · 2023年3月1日

Re-weighting Based Group Fairness Regularization via Classwise Robust Optimization

Arxiv

0+阅读 · 2023年3月1日

Parameter estimation for the stochastic heat equation with multiplicative noise from local measurements

Arxiv

0+阅读 · 2023年2月28日

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Arxiv

15+阅读 · 2021年9月6日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《从装备到文化：美陆军技术素养建设启示录》最新报告

人工智能安全治理白皮书（2025）

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

《商用大语言模型的升级风险管理：国家安全运用》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Implicit Stochastic Gradient Descent for Training Physics-informed Neural Networks

Arxiv

0+阅读 · 2023年3月3日

Evolutionary Multi-Objective Algorithms for the Knapsack Problems with Stochastic Profits

Arxiv

0+阅读 · 2023年3月3日

Training Efficient Controllers via Analytic Policy Gradient

Arxiv

0+阅读 · 2023年3月2日

Neuroevolution Surpasses Stochastic Gradient Descent for Physics-Informed Neural Networks

Arxiv

0+阅读 · 2023年3月2日

Quantifying the mini-batching error in Bayesian inference for Adaptive Langevin dynamics

Arxiv

0+阅读 · 2023年3月1日

Conditional Poisson Stochastic Beam Search

Arxiv

0+阅读 · 2023年3月1日

Re-weighting Based Group Fairness Regularization via Classwise Robust Optimization

Arxiv

0+阅读 · 2023年3月1日

Parameter estimation for the stochastic heat equation with multiplicative noise from local measurements

Arxiv

0+阅读 · 2023年2月28日

A Farewell to the Bias-Variance Tradeoff? An Overview of the Theory of Overparameterized Machine Learning

Arxiv

15+阅读 · 2021年9月6日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

混凝土耐久性损伤光纤光栅声发射传感机理与应用研究

国家自然科学基金

0+阅读 · 2015年12月31日

LincRNA在CYP46A1基因第二内含子T/C多态性对AD易感性影响中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩湍流粒子输运的拉格朗日（Lagrangian）研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有完整与非完整约束刚柔耦合非光滑多体系统动力学数值算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

刚性和柔性配体协同调控的钯(II)、铂(II)配合物的分子设计、结构及细胞毒性

国家自然科学基金

0+阅读 · 2011年12月31日

典型非线性浅水波方程的低正则解散射及渐进性研究

国家自然科学基金

0+阅读 · 2011年12月31日

复合污染条件下DOM对典型离子性抗生素吸附迁移行为的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员