Markov链链碎片梯级法的稳定和普遍化 (Stability and Generalization for Markov Chain Stochastic Gradient Methods) - 专知论文

会员服务 ·

0

Markov · 泛化理论 · Analysis · Minimax · 马尔可夫链 ·

2022 年 9 月 16 日

Stability and Generalization for Markov Chain Stochastic Gradient Methods

翻译：Markov链链碎片梯级法的稳定和普遍化

Puyu Wang,Yunwen Lei,Yiming Ying,Ding-Xuan Zhou

Recently there is a large amount of work devoted to the study of Markov chain stochastic gradient methods (MC-SGMs) which mainly focus on their convergence analysis for solving minimization problems. In this paper, we provide a comprehensive generalization analysis of MC-SGMs for both minimization and minimax problems through the lens of algorithmic stability in the framework of statistical learning theory. For empirical risk minimization (ERM) problems, we establish the optimal excess population risk bounds for both smooth and non-smooth cases by introducing on-average argument stability. For minimax problems, we develop a quantitative connection between on-average argument stability and generalization error which extends the existing results for uniform stability \cite{lei2021stability}. We further develop the first nearly optimal convergence rates for convex-concave problems both in expectation and with high probability, which, combined with our stability results, show that the optimal generalization bounds can be attained for both smooth and non-smooth cases. To the best of our knowledge, this is the first generalization analysis of SGMs when the gradients are sampled from a Markov process.

翻译：最近,在研究Markov链梯度方法(MC-SGMs)方面做了大量工作,主要侧重于对统一分析,以解决最小化问题;在本文件中,我们从统计学习理论框架内的算法稳定性角度,对混合模型问题进行综合分析,以尽量减少和微小问题;关于尽量减少风险的经验性问题,我们通过引入平均参数稳定性,为平稳和非湿性案例建立最佳的超人口风险界限;关于小型最大问题,我们在平均参数稳定性和一般化错误之间建立了定量联系,以扩展统一稳定性的现有结果;我们进一步开发了第一个几乎最佳的合并率,既在预期中,又极有可能,这与我们的稳定结果相结合,表明在平稳和非湿性案例中都能达到最佳的通用界限;就我们所知,这是从马克夫进程取样的梯度中提取梯度时对 SGMs的第一次全面分析。

0

相关内容

Markov

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于间断petrov有限元的Trefftz方法及其在雷达散射截面中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

空间分数阶质量守恒型Allen-Cahn方程的高效数值算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于贝叶斯观点的分数阶扩散方程反问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

分数阶非线性偏微分方程的相关数学问题

国家自然科学基金

0+阅读 · 2014年12月31日

力电联合刺激下成骨细胞的增殖、分化、矿化及胞内Ca2+浓度变化机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

补肾活血方对大鼠骨质疏松症模型Hedgehog信号通路调控及骨髓间充质干细胞成骨分化过程的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

几类随机分数阶复杂网络的参数及状态估计问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

非参数CFAR检测理论及应用

国家自然科学基金

0+阅读 · 2011年12月31日

重调和方程基于Poisson算子的高效有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

用dsDNA微阵列筛选NF-κDNA靶点及靶基因

国家自然科学基金

0+阅读 · 2008年12月31日

Stochastic Convergence Rates and Applications of Adaptive Quadrature in Bayesian Inference

Arxiv

0+阅读 · 2022年10月25日

Local Linear Convergence of Gradient Methods for Subspace Optimization via Strict Complementarity

Arxiv

0+阅读 · 2022年10月25日

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds

Arxiv

0+阅读 · 2022年10月25日

The Stochastic Proximal Distance Algorithm

Arxiv

0+阅读 · 2022年10月25日

Off-Policy Correction for Actor-Critic Methods without Importance Sampling

Arxiv

0+阅读 · 2022年10月24日

Langevin dynamics based algorithm e-TH$\varepsilon$O POULA for stochastic optimization problems with discontinuous stochastic gradient

Arxiv

0+阅读 · 2022年10月24日

Explicit Second-Order Min-Max Optimization Methods with Optimal Convergence Guarantee

Arxiv

0+阅读 · 2022年10月23日

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Arxiv

0+阅读 · 2022年10月23日

Distributionally Robust Bayesian Optimization with $φ$-divergences

Arxiv

0+阅读 · 2022年10月21日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

VIP会员

文章信息

相关主题

马尔可夫链

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

【DLBM-SS暑期课程】深度学习与贝叶斯方法 Deep Learning and Bayesian Methods

专知会员服务

67+阅读 · 2019年11月10日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Stochastic Convergence Rates and Applications of Adaptive Quadrature in Bayesian Inference

Arxiv

0+阅读 · 2022年10月25日

Local Linear Convergence of Gradient Methods for Subspace Optimization via Strict Complementarity

Arxiv

0+阅读 · 2022年10月25日

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds

Arxiv

0+阅读 · 2022年10月25日

The Stochastic Proximal Distance Algorithm

Arxiv

0+阅读 · 2022年10月25日

Off-Policy Correction for Actor-Critic Methods without Importance Sampling

Arxiv

0+阅读 · 2022年10月24日

Langevin dynamics based algorithm e-TH$\varepsilon$O POULA for stochastic optimization problems with discontinuous stochastic gradient

Arxiv

0+阅读 · 2022年10月24日

Explicit Second-Order Min-Max Optimization Methods with Optimal Convergence Guarantee

Arxiv

0+阅读 · 2022年10月23日

Mitigating Gradient Bias in Multi-objective Learning: A Provably Convergent Stochastic Approach

Arxiv

0+阅读 · 2022年10月23日

Distributionally Robust Bayesian Optimization with $φ$-divergences

Arxiv

0+阅读 · 2022年10月21日

Information-theoretic generalization bounds for black-box learning algorithms

Arxiv

12+阅读 · 2021年10月4日

相关基金

基于间断petrov有限元的Trefftz方法及其在雷达散射截面中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

空间分数阶质量守恒型Allen-Cahn方程的高效数值算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

基于贝叶斯观点的分数阶扩散方程反问题研究

国家自然科学基金

0+阅读 · 2015年12月31日

分数阶非线性偏微分方程的相关数学问题

国家自然科学基金

0+阅读 · 2014年12月31日

力电联合刺激下成骨细胞的增殖、分化、矿化及胞内Ca2+浓度变化机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

补肾活血方对大鼠骨质疏松症模型Hedgehog信号通路调控及骨髓间充质干细胞成骨分化过程的实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

几类随机分数阶复杂网络的参数及状态估计问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

非参数CFAR检测理论及应用

国家自然科学基金

0+阅读 · 2011年12月31日

重调和方程基于Poisson算子的高效有限元方法

国家自然科学基金

0+阅读 · 2011年12月31日

用dsDNA微阵列筛选NF-κDNA靶点及靶基因

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员