标准准牛顿方法的非空药性超线性极线性统一 (Non-asymptotic Superlinear Convergence of Standard Quasi-Newton Methods) - 专知论文

会员服务 ·

0

DFP · 拟牛顿法 · BFGS · Lipschitz连续 · 目标函数 ·

2021 年 6 月 29 日

Non-asymptotic Superlinear Convergence of Standard Quasi-Newton Methods

翻译：标准准牛顿方法的非空药性超线性极线性统一

Qiujiang Jin,Aryan Mokhtari

In this paper, we study and prove the non-asymptotic superlinear convergence rate of the Broyden class of quasi-Newton methods including Davidon--Fletcher--Powell (DFP) method and Broyden--Fletcher--Goldfarb--Shanno (BFGS) method. The asymptotic superlinear convergence rate of these quasi-Newton methods has been extensively studied, but their explicit finite time local convergence rate is not fully investigated. In this paper, we provide a finite time (non-asymptotic) convergence analysis for BFGS and DFP methods under the assumptions that the objective function is strongly convex, its gradient is Lipschitz continuous, and its Hessian is Lipschitz continuous only in the direction of the optimal solution. We show that in a local neighborhood of the optimal solution, the iterates generated by both DFP and BFGS converge to the optimal solution at a superlinear rate of $(1/k)^{k/2}$, where $k$ is the number of iterations. We also prove the same local superlinear convergence rate in the case that the objective function is self-concordant. Numerical experiments on different objective functions confirm our explicit convergence rates. Our theoretical guarantee is one of the first results that provide a non-asymptotic superlinear convergence rate for DFP and BFGS quasi-Newton methods.

翻译：在本文中,我们研究并证明准牛顿方法(包括Davidon-Fletcher-Powell(DFP)方法和Broyden-Fletcher-Goldfarb-Shanno(BFGS)方法)的非表面超线性超线性趋同率。这些准牛顿方法(BFGS)的非表面性超线性超线性趋同率得到了广泛的研究,但并未充分调查这些方法的明确有限时间当地趋同率。在本文中,我们为BFGS和DFP方法提供了有限的时间(非表面性)趋同率分析,其假设是:目标功能是很强的 convex,其梯度是Lipschitz-Fletcher-Goldfarb-Shanno(Goldforforb-Shanno)方法,以及BSHiscitzt(Lipschitz)方法只是朝着最佳解决办法的方向持续走下去。我们显示,在最佳解决办法的当地附近地区,DFP和BFGS产生的超线性超线性超线性超线性超线性趋同率率($k) ) 和最优化解决办法最接近最佳解决办法与最佳解决办法相趋同于最接近的融合率。在美元($1/Link-col-col-col-col-col-colental-col-col-colental) 的精确率中,其中, 和我们的不相趋同为一等的理论性试验率。

0

相关内容

DFP

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【经典书】线性代数，Linear Algebra，525页pdf

【经典书】线性代数，Linear Algebra，525页pdf

专知会员服务

78+阅读 · 2021年1月29日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【经典图书】机器学习基础，427页pdf Foundations of machine learning

【经典图书】机器学习基础，427页pdf Foundations of machine learning

专知会员服务

158+阅读 · 2019年11月14日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

《自然》（20190829出版）一周论文导读

《自然》（20190829出版）一周论文导读

科学网

6+阅读 · 2019年8月30日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

Quasi-random words and limits of word sequences

Arxiv

0+阅读 · 2021年8月31日

Uniform Convergence, Adversarial Spheres and a Simple Remedy

Arxiv

0+阅读 · 2021年8月30日

A priori error analysis of high-order LL* (FOSLL*) finite element methods

Arxiv

0+阅读 · 2021年8月30日

Optimizing tree decompositions in MSO

Arxiv

0+阅读 · 2021年8月30日

Calculus of the exponent of Kurdyka-Łojasiewicz inequality and its applications to linear convergence of first-order methods

Arxiv

0+阅读 · 2021年8月30日

Bilevel Optimization: Convergence Analysis and Enhanced Design

Bilevel Optimization: Convergence Analysis and Enhanced Design

Arxiv

0+阅读 · 2021年8月27日

Lower Bounds and Accelerated Algorithms for Bilevel Optimization

Arxiv

0+阅读 · 2021年8月27日

On the nonlinear Dirichlet-Neumann method and preconditioner for Newton's method

Arxiv

0+阅读 · 2021年8月26日

Adaptive and Universal Algorithms for Variational Inequalities with Optimal Convergence

Arxiv

0+阅读 · 2021年8月26日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

VIP会员

文章信息

相关主题

Lipschitz连续

相关VIP内容

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

【经典书】线性代数，Linear Algebra，525页pdf

【经典书】线性代数，Linear Algebra，525页pdf

专知会员服务

78+阅读 · 2021年1月29日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【经典图书】机器学习基础，427页pdf Foundations of machine learning

【经典图书】机器学习基础，427页pdf Foundations of machine learning

专知会员服务

158+阅读 · 2019年11月14日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

《自然》（20190829出版）一周论文导读

《自然》（20190829出版）一周论文导读

科学网

6+阅读 · 2019年8月30日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Adversarial Variational Bayes: Unifying VAE and GAN 代码

Adversarial Variational Bayes: Unifying VAE and GAN 代码

CreateAMind

7+阅读 · 2017年10月4日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

相关论文

Quasi-random words and limits of word sequences

Arxiv

0+阅读 · 2021年8月31日

Uniform Convergence, Adversarial Spheres and a Simple Remedy

Arxiv

0+阅读 · 2021年8月30日

A priori error analysis of high-order LL* (FOSLL*) finite element methods

Arxiv

0+阅读 · 2021年8月30日

Optimizing tree decompositions in MSO

Arxiv

0+阅读 · 2021年8月30日

Calculus of the exponent of Kurdyka-Łojasiewicz inequality and its applications to linear convergence of first-order methods

Arxiv

0+阅读 · 2021年8月30日

Bilevel Optimization: Convergence Analysis and Enhanced Design

Bilevel Optimization: Convergence Analysis and Enhanced Design

Arxiv

0+阅读 · 2021年8月27日

Lower Bounds and Accelerated Algorithms for Bilevel Optimization

Arxiv

0+阅读 · 2021年8月27日

On the nonlinear Dirichlet-Neumann method and preconditioner for Newton's method

Arxiv

0+阅读 · 2021年8月26日

Adaptive and Universal Algorithms for Variational Inequalities with Optimal Convergence

Arxiv

0+阅读 · 2021年8月26日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

微信扫码咨询专知VIP会员