Non-convex Bayesian Learning via Stochastic Gradient Markov Chain Monte Carlo - 专知论文

会员服务 ·

0

蒙特卡罗 · 重要性采样 · 马尔可夫链蒙特卡罗 · 可交换的 · Extensibility ·

2023 年 5 月 30 日

Non-convex Bayesian Learning via Stochastic Gradient Markov Chain Monte Carlo

翻译：暂无翻译

from arxiv, Ph.D. Thesis

The rise of artificial intelligence (AI) hinges on the efficient training of modern deep neural networks (DNNs) for non-convex optimization and uncertainty quantification, which boils down to a non-convex Bayesian learning problem. A standard tool to handle the problem is Langevin Monte Carlo, which proposes to approximate the posterior distribution with theoretical guarantees. In this thesis, we start with the replica exchange Langevin Monte Carlo (also known as parallel tempering), which proposes appropriate swaps between exploration and exploitation to achieve accelerations. However, the na\"ive extension of swaps to big data problems leads to a large bias, and bias-corrected swaps are required. Such a mechanism leads to few effective swaps and insignificant accelerations. To alleviate this issue, we first propose a control variates method to reduce the variance of noisy energy estimators and show a potential to accelerate the exponential convergence. We also present the population-chain replica exchange based on non-reversibility and obtain an optimal round-trip rate for deep learning. In the second part of the thesis, we study scalable dynamic importance sampling algorithms based on stochastic approximation. Traditional dynamic importance sampling algorithms have achieved success, however, the lack of scalability has greatly limited their extensions to big data. To handle this scalability issue, we resolve the vanishing gradient problem and propose two dynamic importance sampling algorithms. Theoretically, we establish the stability condition for the underlying ordinary differential equation (ODE) system and guarantee the asymptotic convergence of the latent variable to the desired fixed point. Interestingly, such a result still holds given non-convex energy landscapes.

翻译：暂无翻译

0

相关内容

蒙特卡罗

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

随机微分方程解的稳定性和矩有界性

国家自然科学基金

0+阅读 · 2015年12月31日

随机偏微分方程及其障碍问题的研究

国家自然科学基金

1+阅读 · 2013年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

肠浒苔（Enteromorpha intestinalis）多糖的结构和抗肿瘤活性随季节与地域变化规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

Volterra泛函微分方程多步Runge-Kutta方法的数值分析及应用

国家自然科学基金

0+阅读 · 2012年12月31日

关于不可压缩流体的粘性消失极限问题

国家自然科学基金

0+阅读 · 2012年12月31日

几何阻挫体系ATO2中自旋、电荷、轨道序及其相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

数值求解最优控制：动态规划方法

国家自然科学基金

1+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

求解随机延迟微分方程的多步方法

国家自然科学基金

0+阅读 · 2009年12月31日

Tuning Stochastic Gradient Algorithms for Statistical Inference via Large-Sample Asymptotics

Arxiv

0+阅读 · 2023年7月20日

Bayesian view on the training of invertible residual networks for solving linear inverse problems

Arxiv

0+阅读 · 2023年7月19日

Convergence Guarantees for Stochastic Subgradient Methods in Nonsmooth Nonconvex Optimization

Arxiv

0+阅读 · 2023年7月19日

Mean-field Approximations for Stochastic Population Processes with Heterogeneous Interactions

Arxiv

0+阅读 · 2023年7月19日

The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions

Arxiv

0+阅读 · 2023年7月19日

Learning from time-dependent streaming data with online stochastic algorithms

Arxiv

0+阅读 · 2023年7月18日

Scaling Laws for Imitation Learning in NetHack

Arxiv

0+阅读 · 2023年7月18日

Nested stochastic block model for simultaneously clustering networks and nodes

Arxiv

0+阅读 · 2023年7月18日

Probabilistic Compute-in-Memory Design For Efficient Markov Chain Monte Carlo Sampling

Arxiv

0+阅读 · 2023年7月16日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

VIP会员

文章信息

相关主题

重要性采样

马尔可夫链蒙特卡罗

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Tuning Stochastic Gradient Algorithms for Statistical Inference via Large-Sample Asymptotics

Arxiv

0+阅读 · 2023年7月20日

Bayesian view on the training of invertible residual networks for solving linear inverse problems

Arxiv

0+阅读 · 2023年7月19日

Convergence Guarantees for Stochastic Subgradient Methods in Nonsmooth Nonconvex Optimization

Arxiv

0+阅读 · 2023年7月19日

Mean-field Approximations for Stochastic Population Processes with Heterogeneous Interactions

Arxiv

0+阅读 · 2023年7月19日

The RL Perceptron: Generalisation Dynamics of Policy Learning in High Dimensions

Arxiv

0+阅读 · 2023年7月19日

Learning from time-dependent streaming data with online stochastic algorithms

Arxiv

0+阅读 · 2023年7月18日

Scaling Laws for Imitation Learning in NetHack

Arxiv

0+阅读 · 2023年7月18日

Nested stochastic block model for simultaneously clustering networks and nodes

Arxiv

0+阅读 · 2023年7月18日

Probabilistic Compute-in-Memory Design For Efficient Markov Chain Monte Carlo Sampling

Arxiv

0+阅读 · 2023年7月16日

Learning with Differentiable Algorithms

Arxiv

11+阅读 · 2022年9月1日

相关基金

随机微分方程解的稳定性和矩有界性

国家自然科学基金

0+阅读 · 2015年12月31日

随机偏微分方程及其障碍问题的研究

国家自然科学基金

1+阅读 · 2013年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

肠浒苔（Enteromorpha intestinalis）多糖的结构和抗肿瘤活性随季节与地域变化规律研究

国家自然科学基金

0+阅读 · 2012年12月31日

Volterra泛函微分方程多步Runge-Kutta方法的数值分析及应用

国家自然科学基金

0+阅读 · 2012年12月31日

关于不可压缩流体的粘性消失极限问题

国家自然科学基金

0+阅读 · 2012年12月31日

几何阻挫体系ATO2中自旋、电荷、轨道序及其相互作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

数值求解最优控制：动态规划方法

国家自然科学基金

1+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

求解随机延迟微分方程的多步方法

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员