逐步确定函数的 SGD 成功击败 SGD (Stochastic Second-Order Methods Provably Beat SGD For Gradient-Dominated Functions) - 专知论文

会员服务 ·

0

ScRN · 样本复杂度 · Performer · 泛函 · SGD ·

2022 年 5 月 25 日

Stochastic Second-Order Methods Provably Beat SGD For Gradient-Dominated Functions

翻译：逐步确定函数的 SGD 成功击败 SGD

Saeed Masiha,Saber Salehkaleybar,Niao He,Negar Kiyavash,Patrick Thiran

We study the performance of Stochastic Cubic Regularized Newton (SCRN) on a class of functions satisfying gradient dominance property which holds in a wide range of applications in machine learning and signal processing. This condition ensures that any first-order stationary point is a global optimum. We prove that SCRN improves the best-known sample complexity of stochastic gradient descent in achieving $\epsilon$-global optimum by a factor of $\mathcal{O}(\epsilon^{-1/2})$. Even under a weak version of gradient dominance property, which is applicable to policy-based reinforcement learning (RL), SCRN achieves the same improvement over stochastic policy gradient methods. Additionally, we show that the sample complexity of SCRN can be improved by a factor of ${\mathcal{O}}(\epsilon^{-1/2})$ using a variance reduction method with time-varying batch sizes. Experimental results in various RL settings showcase the remarkable performance of SCRN compared to first-order methods.

翻译：我们研究Stochastic Cubic 正规化牛顿(SCRN)在满足梯度主导属性的功能类别方面的表现,该功能类别在机器学习和信号处理方面有着广泛的应用,确保任何一级固定点都是全球最佳的。我们证明,SCRN通过一个因子$\mathcal{O}(\epsilon ⁇ -1/2}),提高了随机梯度梯度梯度下最著名的样本复杂性,以达到美元=epsilon$-全球最佳。即使在一个较弱的梯度主导属性(适用于基于政策的强化学习(RL))下,SCRN也取得了与偏差政策梯度梯度梯度方法相同的改进。此外,我们表明,SCRN的样本复杂性可以用一个因子($_mathcal{O}(\\\\\\\\\\\\\\\\\ 1/2}来改进,使用时间变化批量大小的减差法。各种RL环境的实验结果展示了SCRN相对于一级方法的显著表现。

0

相关内容

ScRN

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

14+阅读 · 2019年11月22日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

HTR1D基因在原发性先天性青光眼发病机制中的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

振荡型积分的有界性质及其在色散方程中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

mDia1调控MDS造血干细胞老化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

油藏多孔介质渗流与Stokes流耦合问题数值方法

国家自然科学基金

0+阅读 · 2011年12月31日

矩阵分解的低延迟并行算法

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods

Arxiv

0+阅读 · 2022年7月14日

Confounding-adjustment methods for the difference in medians

Arxiv

0+阅读 · 2022年7月13日

The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation

Arxiv

0+阅读 · 2022年7月12日

Application and Assessment of Divide-and-Conquer-based Heuristic Algorithms for some Integer Optimization Problems

Application and Assessment of Divide-and-Conquer-based Heuristic Algorithms for some Integer Optimization Problems

Arxiv

0+阅读 · 2022年7月12日

A Newton-CG based barrier method for finding a second-order stationary point of nonconvex conic optimization with complexity guarantees

Arxiv

0+阅读 · 2022年7月12日

An Information-Theoretic Analysis for Transfer Learning: Error Bounds and Applications

Arxiv

0+阅读 · 2022年7月12日

Finite Element Method for a Nonlinear PML Helmholtz Equation with High Wave Number

Arxiv

0+阅读 · 2022年7月11日

Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Arxiv

0+阅读 · 2022年7月8日

The Positive Effects of Stochastic Rounding in Numerical Algorithms

Arxiv

0+阅读 · 2022年7月8日

A Machine Learning approach to enhance the SUPG stabilization method for advection-dominated differential problems

Arxiv

0+阅读 · 2022年7月8日

VIP会员

文章信息

相关主题

样本复杂度

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

14+阅读 · 2019年11月22日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | 自动化所新作速览（一）

大型语言模型（LLM）赋能的知识图谱构建：综述

NeurIPS 2025 | 自动化所新作速览（二）

领域特定文本分类中的预训练语言模型新进展：系统综述

相关资讯

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Online Bilevel Optimization: Regret Analysis of Online Alternating Gradient Methods

Arxiv

0+阅读 · 2022年7月14日

Confounding-adjustment methods for the difference in medians

Arxiv

0+阅读 · 2022年7月13日

The Role of Lookahead and Approximate Policy Evaluation in Reinforcement Learning with Linear Value Function Approximation

Arxiv

0+阅读 · 2022年7月12日

Application and Assessment of Divide-and-Conquer-based Heuristic Algorithms for some Integer Optimization Problems

Application and Assessment of Divide-and-Conquer-based Heuristic Algorithms for some Integer Optimization Problems

Arxiv

0+阅读 · 2022年7月12日

A Newton-CG based barrier method for finding a second-order stationary point of nonconvex conic optimization with complexity guarantees

Arxiv

0+阅读 · 2022年7月12日

An Information-Theoretic Analysis for Transfer Learning: Error Bounds and Applications

Arxiv

0+阅读 · 2022年7月12日

Finite Element Method for a Nonlinear PML Helmholtz Equation with High Wave Number

Arxiv

0+阅读 · 2022年7月11日

Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Arxiv

0+阅读 · 2022年7月8日

The Positive Effects of Stochastic Rounding in Numerical Algorithms

Arxiv

0+阅读 · 2022年7月8日

A Machine Learning approach to enhance the SUPG stabilization method for advection-dominated differential problems

Arxiv

0+阅读 · 2022年7月8日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

HTR1D基因在原发性先天性青光眼发病机制中的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

Heisenberg群与Minkowski空间中的非线性椭圆方程

国家自然科学基金

0+阅读 · 2014年12月31日

振荡型积分的有界性质及其在色散方程中的应用

国家自然科学基金

0+阅读 · 2013年12月31日

mDia1调控MDS造血干细胞老化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

油藏多孔介质渗流与Stokes流耦合问题数值方法

国家自然科学基金

0+阅读 · 2011年12月31日

矩阵分解的低延迟并行算法

国家自然科学基金

0+阅读 · 2009年12月31日

广义Fermat猜想与相关的丢番图方程

国家自然科学基金

1+阅读 · 2009年12月31日

拋物奇异积分算子有界性及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员