Stochastic Zeroth-order Gradient Descent在Lojasiewicz函数中的收敛速率 (Convergence Rates of Stochastic Zeroth-order Gradient Descent for Ł ojasiewicz Functions) - 专知论文

会员服务 ·

0

收敛速率 · 非光滑 · DOT · 步长 · 学习率 ·

2023 年 4 月 19 日

Convergence Rates of Stochastic Zeroth-order Gradient Descent for Ł ojasiewicz Functions

翻译：Stochastic Zeroth-order Gradient Descent在Lojasiewicz函数中的收敛速率

Tianyu Wang,Yasong Feng

We prove convergence rates of Stochastic Zeroth-order Gradient Descent (SZGD) algorithms for Lojasiewicz functions. The SZGD algorithm iterates as \begin{align*} \mathbf{x}_{t+1} = \mathbf{x}_t - \eta_t \widehat{\nabla} f (\mathbf{x}_t), \qquad t = 0,1,2,3,\cdots , \end{align*} where $f$ is the objective function that satisfies the \L ojasiewicz inequality with \L ojasiewicz exponent $\theta$, $\eta_t$ is the step size (learning rate), and $ \widehat{\nabla} f (\mathbf{x}_t) $ is the approximate gradient estimated using zeroth-order information only. Our results show that $ \{ f (\mathbf{x}_t) - f (\mathbf{x}_\infty) \}_{t \in \mathbb{N} } $ can converge faster than $ \{ \| \mathbf{x}_t - \mathbf{x}_\infty \| \}_{t \in \mathbb{N} }$, regardless of whether the objective $f$ is smooth or nonsmooth.

翻译：本文证明了Stochastic Zeroth-order Gradient Descent（SZGD）算法在Lojasiewicz函数中的收敛速率。SZGD算法迭代如下：\begin{align*}\mathbf { x } _ { t + 1 } = \mathbf { x }_t - \eta_t \widehat{\nabla} f (\mathbf{x}_t), \qquad t = 0,1,2,3, \cdots,\end{align*} 其中$f$是满足Lojasiewicz不等式的目标函数，$\theta$是Lojasiewicz指数，$\eta _ t$是步长（学习率），$ \widehat{\nabla} f (\mathbf{x}_t) $是仅使用零阶信息估计的近似梯度。我们的结果表明，无论目标$f$是光滑还是非光滑的，$ \{ f (\mathbf{x}_t) - f (\mathbf{x}_\infty) \}_{t \in \mathbb{N} } $都可能比$ \{ \| \mathbf{x}_t - \mathbf{x}_\infty \| \}_{t \in \mathbb{N} }$更快地收敛，。

0

相关内容

收敛速率

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

104+阅读 · 2023年5月10日

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

最新论文《基于无人机基站的下一代物联网：群体智能方法的比较》西马其顿大学等高校6位 Senior Member, IEEE，Drone-Base-Station for Next-Generation Internet-of-Things: A Comparison of Swarm Intelligence Approaches

最新论文《基于无人机基站的下一代物联网：群体智能方法的比较》西马其顿大学等高校6位 Senior Member, IEEE，Drone-Base-Station for Next-Generation Internet-of-Things: A Comparison of Swarm Intelligence Approaches

专知会员服务

32+阅读 · 2022年4月7日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

83+阅读 · 2022年3月19日

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

专知会员服务

57+阅读 · 2020年3月13日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

13+阅读 · 2019年11月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最全机器学习优化器Optimizer汇总

最全机器学习优化器Optimizer汇总

极市平台

0+阅读 · 2022年10月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

动手写机器学习算法：异常检测 Anomaly Detection

动手写机器学习算法：异常检测 Anomaly Detection

七月在线实验室

11+阅读 · 2017年12月8日

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

广义欧拉多项式的实根性

国家自然科学基金

0+阅读 · 2015年12月31日

适定的多元样条逼近方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

矩阵方程秩约束广义最佳逼近理论及应用

国家自然科学基金

1+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

图谱理论中若干问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

丢番图逼近、分形几何及相关问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

Hyers-Ulam 稳定性及其应用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

函数空间与逼近理论中若干问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

Fast and high-order approximation of parabolic equations using hierarchical direct solvers and implicit Runge-Kutta methods

Arxiv

0+阅读 · 2023年6月5日

Geometric Convergence of Distributed Heavy-Ball Nash Equilibrium Algorithm over Time-Varying Digraphs with Unconstrained Actions

Arxiv

0+阅读 · 2023年6月3日

Uniform Convergence of Deep Neural Networks with Lipschitz Continuous Activation Functions and Variable Widths

Arxiv

0+阅读 · 2023年6月2日

Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent

Arxiv

0+阅读 · 2023年6月2日

On the Convergence of Coordinate Ascent Variational Inference

Arxiv

0+阅读 · 2023年6月1日

Gauss-Southwell type descent methods for low-rank matrix optimization

Arxiv

0+阅读 · 2023年6月1日

Efficient algorithms for certifying lower bounds on the discrepancy of random matrices

Arxiv

0+阅读 · 2023年6月1日

The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent

Arxiv

0+阅读 · 2023年6月1日

The Backpropagation algorithm for a math student

Arxiv

0+阅读 · 2023年5月31日

On Mixing Rates for Bayesian CART

Arxiv

0+阅读 · 2023年5月31日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

104+阅读 · 2023年5月10日

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

最新论文《基于无人机基站的下一代物联网：群体智能方法的比较》西马其顿大学等高校6位 Senior Member, IEEE，Drone-Base-Station for Next-Generation Internet-of-Things: A Comparison of Swarm Intelligence Approaches

最新论文《基于无人机基站的下一代物联网：群体智能方法的比较》西马其顿大学等高校6位 Senior Member, IEEE，Drone-Base-Station for Next-Generation Internet-of-Things: A Comparison of Swarm Intelligence Approaches

专知会员服务

32+阅读 · 2022年4月7日

南大《优化方法（Optimization Methods》课程，推荐！

南大《优化方法（Optimization Methods》课程，推荐！

专知会员服务

80+阅读 · 2022年4月3日

机器学习损失函数概述，Loss Functions in Machine Learning

机器学习损失函数概述，Loss Functions in Machine Learning

专知会员服务

83+阅读 · 2022年3月19日

【经典书】线性代数，436页pdf

专知会员服务

77+阅读 · 2021年3月16日

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

专知会员服务

57+阅读 · 2020年3月13日

【MIT】时间序列GAN，Subadditivity of Probability Divergences

专知会员服务

63+阅读 · 2020年3月4日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

13+阅读 · 2019年11月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

最全机器学习优化器Optimizer汇总

最全机器学习优化器Optimizer汇总

极市平台

0+阅读 · 2022年10月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

动手写机器学习算法：异常检测 Anomaly Detection

动手写机器学习算法：异常检测 Anomaly Detection

七月在线实验室

11+阅读 · 2017年12月8日

相关论文

Fast and high-order approximation of parabolic equations using hierarchical direct solvers and implicit Runge-Kutta methods

Arxiv

0+阅读 · 2023年6月5日

Geometric Convergence of Distributed Heavy-Ball Nash Equilibrium Algorithm over Time-Varying Digraphs with Unconstrained Actions

Arxiv

0+阅读 · 2023年6月3日

Uniform Convergence of Deep Neural Networks with Lipschitz Continuous Activation Functions and Variable Widths

Arxiv

0+阅读 · 2023年6月2日

Towards Understanding the Dynamics of Gaussian-Stein Variational Gradient Descent

Arxiv

0+阅读 · 2023年6月2日

On the Convergence of Coordinate Ascent Variational Inference

Arxiv

0+阅读 · 2023年6月1日

Gauss-Southwell type descent methods for low-rank matrix optimization

Arxiv

0+阅读 · 2023年6月1日

Efficient algorithms for certifying lower bounds on the discrepancy of random matrices

Arxiv

0+阅读 · 2023年6月1日

The Implicit Regularization of Dynamical Stability in Stochastic Gradient Descent

Arxiv

0+阅读 · 2023年6月1日

The Backpropagation algorithm for a math student

Arxiv

0+阅读 · 2023年5月31日

On Mixing Rates for Bayesian CART

Arxiv

0+阅读 · 2023年5月31日

相关基金

罗巴代数的表示和罗巴代数在operad中的应用

国家自然科学基金

0+阅读 · 2015年12月31日

广义欧拉多项式的实根性

国家自然科学基金

0+阅读 · 2015年12月31日

适定的多元样条逼近方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

矩阵方程秩约束广义最佳逼近理论及应用

国家自然科学基金

1+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

图谱理论中若干问题的研究

国家自然科学基金

0+阅读 · 2012年12月31日

丢番图逼近、分形几何及相关问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

Hyers-Ulam 稳定性及其应用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

函数空间与逼近理论中若干问题的研究

国家自然科学基金

0+阅读 · 2011年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员