关于Stochastistic 牛顿算法及其加权平均版本的无症状趋同率 (On the asymptotic rate of convergence of Stochastic Newton algorithms and their Weighted Averaged versions) - 专知论文

会员服务 ·

0

估计/估计量 · Weight · CASE · 几乎必然收敛 · 风险函数 ·

2021 年 1 月 12 日

On the asymptotic rate of convergence of Stochastic Newton algorithms and their Weighted Averaged versions

翻译：关于Stochastistic 牛顿算法及其加权平均版本的无症状趋同率

Claire Boyer,Antoine Godichon-Baggioni

The majority of machine learning methods can be regarded as the minimization of an unavailable risk function. To optimize the latter, given samples provided in a streaming fashion, we define a general stochastic Newton algorithm and its weighted average version. In several use cases, both implementations will be shown not to require the inversion of a Hessian estimate at each iteration, but a direct update of the estimate of the inverse Hessian instead will be favored. This generalizes a trick introduced in [2] for the specific case of logistic regression, by directly updating the estimate of the inverse Hessian. Under mild assumptions such as local strong convexity at the optimum, we establish almost sure convergences and rates of convergence of the algorithms, as well as central limit theorems for the constructed parameter estimates. The unified framework considered in this paper covers the case of linear, logistic or softmax regressions to name a few. Numerical experiments on simulated data give the empirical evidence of the pertinence of the proposed methods, which outperform popular competitors particularly in case of bad initializa-tions.

翻译：为了优化后者,我们根据以流态方式提供的样本,定义了一个通用的随机牛顿算法及其加权平均版本。在若干使用案例中,将显示两种应用不要求每次迭代时转换赫森估计值,但直接更新赫森反面估计值将更可取。这概括了[2]中为物流回归这一具体案例引入的伎俩,直接更新了逆赫森的估算值。根据当地强力共和度最佳等温和假设,我们几乎肯定了算法的趋同率和趋同率,以及构建参数估计的中央限制参数值。本文所考虑的统一框架涵盖了线性、后勤性或软形回归等案例。模拟数据的数值实验为拟议方法的相关性提供了经验性证据,这些方法在初始状态不佳的情况下尤其超越了大众竞争者。

0

相关内容

估计/估计量

估计/估计量

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

最新《图神经网络知识图谱补全》综述论文

最新《图神经网络知识图谱补全》综述论文

专知会员服务

157+阅读 · 2020年7月29日

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

专知会员服务

57+阅读 · 2020年3月13日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【机器学习与深度学习基础性算法】Foundational ML and DL Algorithms

【机器学习与深度学习基础性算法】Foundational ML and DL Algorithms

专知会员服务

34+阅读 · 2019年12月27日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

已删除

将门创投

4+阅读 · 2019年8月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

On the Oracle Complexity of Higher-Order Smooth Non-Convex Finite-Sum Optimization

Arxiv

0+阅读 · 2021年3月8日

Convergence and Accuracy Trade-Offs in Federated Learning and Meta-Learning

Arxiv

0+阅读 · 2021年3月8日

Bidirectional compression in heterogeneous settings for distributed or federated learning with partial participation: tight convergence guarantees

Arxiv

0+阅读 · 2021年3月8日

Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games

Arxiv

0+阅读 · 2021年3月8日

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

Arxiv

0+阅读 · 2021年3月5日

Second-order step-size tuning of SGD for non-convex optimization

Arxiv

0+阅读 · 2021年3月5日

Quantum Algorithm for Online Convex Optimization

Arxiv

0+阅读 · 2021年3月5日

Multilevel quasi-Monte Carlo for random elliptic eigenvalue problems II: Efficient algorithms and numerical results

Arxiv

0+阅读 · 2021年3月5日

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

Arxiv

4+阅读 · 2019年5月9日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

VIP会员

文章信息

相关主题

估计/估计量

几乎必然收敛

相关VIP内容

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

最新《图神经网络知识图谱补全》综述论文

最新《图神经网络知识图谱补全》综述论文

专知会员服务

157+阅读 · 2020年7月29日

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

专知会员服务

57+阅读 · 2020年3月13日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

【机器学习与深度学习基础性算法】Foundational ML and DL Algorithms

【机器学习与深度学习基础性算法】Foundational ML and DL Algorithms

专知会员服务

34+阅读 · 2019年12月27日

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

【课程】普林斯顿大学19年春季学期《机器学习优化》课程讲义

专知会员服务

85+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

已删除

将门创投

4+阅读 · 2019年8月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

相关论文

On the Oracle Complexity of Higher-Order Smooth Non-Convex Finite-Sum Optimization

Arxiv

0+阅读 · 2021年3月8日

Convergence and Accuracy Trade-Offs in Federated Learning and Meta-Learning

Arxiv

0+阅读 · 2021年3月8日

Bidirectional compression in heterogeneous settings for distributed or federated learning with partial participation: tight convergence guarantees

Arxiv

0+阅读 · 2021年3月8日

Model-Free Online Learning in Unknown Sequential Decision Making Problems and Games

Arxiv

0+阅读 · 2021年3月8日

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

Arxiv

0+阅读 · 2021年3月5日

Second-order step-size tuning of SGD for non-convex optimization

Arxiv

0+阅读 · 2021年3月5日

Quantum Algorithm for Online Convex Optimization

Arxiv

0+阅读 · 2021年3月5日

Multilevel quasi-Monte Carlo for random elliptic eigenvalue problems II: Efficient algorithms and numerical results

Arxiv

0+阅读 · 2021年3月5日

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

The Effect of Network Width on Stochastic Gradient Descent and Generalization: an Empirical Study

Arxiv

4+阅读 · 2019年5月9日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

微信扫码咨询专知VIP会员