DRSOM: 降低尺寸的第二顺序方法 (DRSOM: A Dimension Reduced Second-Order Method) - 专知论文

会员服务 ·

0

可约的 · Performer · Learning · Subspace · Krylov方法 ·

2023 年 1 月 2 日

DRSOM: A Dimension Reduced Second-Order Method

翻译：DRSOM: 降低尺寸的第二顺序方法

Chuwen Zhang,Dongdong Ge,Chang He,Bo Jiang,Yuntian Jiang,Yinyu Ye

from arxiv, Considerable changes in the main text. 31 pages

In this paper, we propose a Dimension-Reduced Second-Order Method (DRSOM) for convex and nonconvex (unconstrained) optimization. Under a trust-region-like framework, our method preserves the convergence of the second-order method while using only curvature information in a few directions. Consequently, the computational overhead of our method remains comparable to the first-order such as the gradient descent method. Theoretically, we show that the method has a local quadratic convergence and a global convergence rate of $O(\epsilon^{-3/2})$ to satisfy the first-order and second-order conditions if the subspace satisfies a commonly adopted approximated Hessian assumption. We further show that this assumption can be removed if we perform one \emph{corrector step} (using a Krylov method, for example) periodically at the end stage of the algorithm. The applicability and performance of DRSOM are exhibited by various computational experiments, particularly in machine learning and deep learning. For neural networks, our preliminary implementation seems to gain computational advantages in terms of training accuracy and iteration complexity over state-of-the-art first-order methods such as SGD and ADAM.

翻译：在本文中,我们提议了一种用于 convex 和非convex (不受限制的) 优化的尺寸降第二奥点法(DRSOM) 。在类似信任区域的框架内,我们的方法保持了二阶方法的趋同,同时只使用一些缩进信息。因此,我们方法的计算间接费用仍然与一级方法(如梯度下降法)相仿。理论上,我们表明,该方法具有局部的四端趋同率和全球汇合率$O(\epsilon ⁇ -3/2}),以满足第一阶和第二阶条件,如果子空间满足了通常采用的大约赫斯假设。我们进一步表明,如果我们在算法的最后阶段定期执行一个\emph{校正(例如使用Krylov方法),这一假设是可以消除的。DRSOM的适用性和性表现在各种计算实验中,特别是在机器学习和深层次学习中。对于神经网络来说,我们的初步实施似乎在培训精度和超状态的重新定位方法方面获得了计算优势。

0

相关内容

可约的

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

SIRT1介导的Resveratrol对糖尿病视网膜病变“代谢记忆”的作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

CaHsfA2和CaHsfA6b转录因子对辣椒温敏雄性不育系育性转换的调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

重金属离子胁迫下花斑裸鲤钙调蛋白磷酸酶(Calcineurin)的应答及其分子调节机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

三正则图的嵌入性质及其应用

国家自然科学基金

0+阅读 · 2013年12月31日

钚多相物性的DFT+Gutzwiller方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

非对称矩阵优化问题的灵敏度分析、算法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

应用转基因斑马鱼研究同源盒基因HOXB4在造血干细胞发育调控中的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

光场的强度关联成像及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

Dimension-reduced KRnet maps for high-dimensional inverse problems

Arxiv

0+阅读 · 2023年3月1日

Regularized Newton Method with Global $O(1/k^2)$ Convergence

Arxiv

0+阅读 · 2023年3月1日

Operator-difference schemes on non-uniform grids for second-order evolutionary equations

Arxiv

0+阅读 · 2023年3月1日

High Probability Convergence of Stochastic Gradient Methods

Arxiv

0+阅读 · 2023年2月28日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Arxiv

0+阅读 · 2023年2月28日

Minimax Optimal Clustering of Bipartite Graphs with a Generalized Power Method

Arxiv

0+阅读 · 2023年2月27日

Parameter-free Regret in High Probability with Heavy Tails

Arxiv

0+阅读 · 2023年2月25日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Fast Kernel Methods for Generic Lipschitz Losses via $p$-Sparsified Sketches

Arxiv

0+阅读 · 2023年2月24日

Lazy Parameter Tuning and Control: Choosing All Parameters Randomly From a Power-Law Distribution

Arxiv

0+阅读 · 2023年2月24日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Dimension-reduced KRnet maps for high-dimensional inverse problems

Arxiv

0+阅读 · 2023年3月1日

Regularized Newton Method with Global $O(1/k^2)$ Convergence

Arxiv

0+阅读 · 2023年3月1日

Operator-difference schemes on non-uniform grids for second-order evolutionary equations

Arxiv

0+阅读 · 2023年3月1日

High Probability Convergence of Stochastic Gradient Methods

Arxiv

0+阅读 · 2023年2月28日

An active-set method for sparse approximations. Part I: Separable $\ell_1$ terms

Arxiv

0+阅读 · 2023年2月28日

Minimax Optimal Clustering of Bipartite Graphs with a Generalized Power Method

Arxiv

0+阅读 · 2023年2月27日

Parameter-free Regret in High Probability with Heavy Tails

Arxiv

0+阅读 · 2023年2月25日

Randomized low-rank approximation of parameter-dependent matrices

Arxiv

0+阅读 · 2023年2月24日

Fast Kernel Methods for Generic Lipschitz Losses via $p$-Sparsified Sketches

Arxiv

0+阅读 · 2023年2月24日

Lazy Parameter Tuning and Control: Choosing All Parameters Randomly From a Power-Law Distribution

Arxiv

0+阅读 · 2023年2月24日

相关基金

SIRT1介导的Resveratrol对糖尿病视网膜病变“代谢记忆”的作用及其机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

CaHsfA2和CaHsfA6b转录因子对辣椒温敏雄性不育系育性转换的调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

重金属离子胁迫下花斑裸鲤钙调蛋白磷酸酶(Calcineurin)的应答及其分子调节机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

三正则图的嵌入性质及其应用

国家自然科学基金

0+阅读 · 2013年12月31日

钚多相物性的DFT+Gutzwiller方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

非对称矩阵优化问题的灵敏度分析、算法及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

应用转基因斑马鱼研究同源盒基因HOXB4在造血干细胞发育调控中的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

光场的强度关联成像及其应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员