查找本地迷微米的更快、受扰动的碎石渐变方法 (Faster Perturbed Stochastic Gradient Methods for Finding Local Minima) - 专知论文

会员服务 ·

0

局部极小 · 极小值 · 鞍点 · 极小点 · 非凸 ·

2022 年 4 月 20 日

Faster Perturbed Stochastic Gradient Methods for Finding Local Minima

翻译：查找本地迷微米的更快、受扰动的碎石渐变方法

Zixiang Chen,Dongruo Zhou,Quanquan Gu

from arxiv, 29 pages, 1 figure, 1 table. In ALT 2022

Escaping from saddle points and finding local minimum is a central problem in nonconvex optimization. Perturbed gradient methods are perhaps the simplest approach for this problem. However, to find $(\epsilon, \sqrt{\epsilon})$-approximate local minima, the existing best stochastic gradient complexity for this type of algorithms is $\tilde O(\epsilon^{-3.5})$, which is not optimal. In this paper, we propose LENA (Last stEp shriNkAge), a faster perturbed stochastic gradient framework for finding local minima. We show that LENA with stochastic gradient estimators such as SARAH/SPIDER and STORM can find $(\epsilon, \epsilon_{H})$-approximate local minima within $\tilde O(\epsilon^{-3} + \epsilon_{H}^{-6})$ stochastic gradient evaluations (or $\tilde O(\epsilon^{-3})$ when $\epsilon_H = \sqrt{\epsilon}$). The core idea of our framework is a step-size shrinkage scheme to control the average movement of the iterates, which leads to faster convergence to the local minima.

翻译：从马鞍点跳出并找到本地最小值是非convex 优化的一个中心问题。不稳定梯度方法也许是这一问题的最简单的方法。但是, 要找到$( epsilon, \ sqrt ~ epsilon} ) 近似本地迷你, 现有的这种算法的最佳随机梯度复杂性是$( etilde O (\ epsilon) =- 3.5} 美元, 这并不是最理想的。在本文中, 我们提议使用 leNA ( last stEp shryNkAge), 快速的过敏梯度梯度框架( 或 $\ tilde O (\ epsilon) - 3} 来寻找本地迷你度梯度估计器, 例如 SASAH/ IDER 和 StorM 等, 现有最先进的梯度梯度梯度梯度梯度梯度梯度梯度梯度精度精度梯在 $ (\ silon, legn) rocal rodude sultlate_ slation_ a colver leglection) roflate_ slation_ sluplancelate rocal_ legal_ legy legal_ legal_ legaltaltaltal_ lection_ legal_ legal_ legal_ lection_ legal_ legaltalt_ lection_ lection_ lection_ legy_ legal_ legal_ legy_ le.

0

相关内容

局部极小

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

重离子储存环CSRe上激光冷却相对论能量类锂12C3+离子束的实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

少自由度空间耦合机构构型综合理论与方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

酸雨环境下既有拱桥吊杆抗力衰减规律与寿命预测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒增强复合材料结构损伤演化和破坏的FEM-VCFEM-MD多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

基于粘滑原理的球形检测机器人原地转向运动时变滑模控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HHT的超光谱图像高精度分类算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

ICF中高能电子和离子输运的Monte-Carlo算法研究和程序研制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Refined Convergence and Topology Learning for Decentralized Optimization with Heterogeneous Data

Arxiv

0+阅读 · 2022年6月10日

Stochastic Zeroth order Descent with Structured Directions

Arxiv

0+阅读 · 2022年6月10日

$p$-Sparsified Sketches for Fast Multiple Output Kernel Methods

Arxiv

0+阅读 · 2022年6月10日

On Gradient Descent Convergence beyond the Edge of Stability

Arxiv

0+阅读 · 2022年6月8日

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

Arxiv

0+阅读 · 2022年6月8日

A Unified Convergence Theorem for Stochastic Optimization Methods

Arxiv

0+阅读 · 2022年6月8日

Benign Underfitting of Stochastic Gradient Descent

Arxiv

0+阅读 · 2022年6月7日

Algorithmic Stability of Heavy-Tailed Stochastic Gradient Descent on Least Squares

Arxiv

0+阅读 · 2022年6月6日

Why Do Local Methods Solve Nonconvex Problems?

Arxiv

12+阅读 · 2021年3月24日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Refined Convergence and Topology Learning for Decentralized Optimization with Heterogeneous Data

Arxiv

0+阅读 · 2022年6月10日

Stochastic Zeroth order Descent with Structured Directions

Arxiv

0+阅读 · 2022年6月10日

$p$-Sparsified Sketches for Fast Multiple Output Kernel Methods

Arxiv

0+阅读 · 2022年6月10日

On Gradient Descent Convergence beyond the Edge of Stability

Arxiv

0+阅读 · 2022年6月8日

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

Arxiv

0+阅读 · 2022年6月8日

A Unified Convergence Theorem for Stochastic Optimization Methods

Arxiv

0+阅读 · 2022年6月8日

Benign Underfitting of Stochastic Gradient Descent

Arxiv

0+阅读 · 2022年6月7日

Algorithmic Stability of Heavy-Tailed Stochastic Gradient Descent on Least Squares

Arxiv

0+阅读 · 2022年6月6日

Why Do Local Methods Solve Nonconvex Problems?

Arxiv

12+阅读 · 2021年3月24日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

相关基金

重离子储存环CSRe上激光冷却相对论能量类锂12C3+离子束的实验研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

少自由度空间耦合机构构型综合理论与方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

酸雨环境下既有拱桥吊杆抗力衰减规律与寿命预测方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒增强复合材料结构损伤演化和破坏的FEM-VCFEM-MD多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

基于粘滑原理的球形检测机器人原地转向运动时变滑模控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HHT的超光谱图像高精度分类算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

ICF中高能电子和离子输运的Monte-Carlo算法研究和程序研制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员