超过网络的牛顿方法快速到统计精度 (Newton Method over Networks is Fast up to the Statistical Precision) - 专知论文

会员服务 ·

0

统计量 · Networking · 查准率/准确率 · 经验风险最小化 · 经验风险 ·

2021 年 6 月 16 日

Newton Method over Networks is Fast up to the Statistical Precision

翻译：超过网络的牛顿方法快速到统计精度

Amir Daneshmand,Gesualdo Scutari,Pavel Dvurechensky,Alexander Gasnikov

from arxiv, In proceedings of the 38th International Conference on Machine Learning, PMLR 139, 2021

We propose a distributed cubic regularization of the Newton method for solving (constrained) empirical risk minimization problems over a network of agents, modeled as undirected graph. The algorithm employs an inexact, preconditioned Newton step at each agent's side: the gradient of the centralized loss is iteratively estimated via a gradient-tracking consensus mechanism and the Hessian is subsampled over the local data sets. No Hessian matrices are thus exchanged over the network. We derive global complexity bounds for convex and strongly convex losses. Our analysis reveals an interesting interplay between sample and iteration/communication complexity: statistically accurate solutions are achievable in roughly the same number of iterations of the centralized cubic Newton method, with a communication cost per iteration of the order of $\widetilde{\mathcal{O}}\big(1/\sqrt{1-\rho}\big)$, where $\rho$ characterizes the connectivity of the network. This demonstrates a significant communication saving with respect to that of existing, statistically oblivious, distributed Newton-based methods over networks.

翻译：我们建议对牛顿解决(受限制的)实证风险最小化问题的方法进行分布式的立方正规化,在代理商的网络上,以未定向的图表为模型。算法在每个代理商的侧面使用不精确的、有先决条件的牛顿步骤:集中损失的梯度通过梯度跟踪共识机制进行迭代估计,赫森对本地数据集进行子取样。因此,网络上没有交换赫森基质。我们从全球复杂度中得出锥形和强烈锥形损失。我们的分析揭示了抽样和迭代/通信复杂性之间的令人感兴趣的相互作用:在集中的牛顿立方法的迭代数量上,统计准确的解决方案大致可以实现,按美元全方位的顺序来计算通信成本。O ⁇ big(1/\qrt{1-\rho ⁇ big),其中美元是网络连接的特征。这显示了现有、统计上遗忘的、基于纽顿的基于网络的方法在网络上的巨大通信节约。

0

相关内容

统计量

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

127+阅读 · 2021年6月4日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

126+阅读 · 2020年8月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【优博微展2019】李志泽：简单快速的机器学习优化方法

【优博微展2019】李志泽：简单快速的机器学习优化方法

清华大学研究生教育

14+阅读 · 2019年10月8日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

数据分析师应该知道的16种回归技术：弹性网络回归

数据分析师应该知道的16种回归技术：弹性网络回归

数萃大数据

91+阅读 · 2018年8月16日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

已删除

将门创投

6+阅读 · 2017年11月27日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

Using Multilevel Circulant Matrix Approximate to Speed Up Kernel Logistic Regression

Arxiv

0+阅读 · 2021年8月19日

A Recursive Logit Model with Choice Aversion and Its Application to Transportation Networks

A Recursive Logit Model with Choice Aversion and Its Application to Transportation Networks

Arxiv

0+阅读 · 2021年8月18日

Adaptive Non-parametric Estimation of Mean and Autocovariance in Regression with Dependent Errors

Arxiv

0+阅读 · 2021年8月18日

A Carleman-based numerical method for quasilinear elliptic equations with over-determined boundary data and applications

Arxiv

0+阅读 · 2021年8月18日

Statistically Near-Optimal Hypothesis Selection

Arxiv

0+阅读 · 2021年8月17日

Stochastic optimization under time drift: iterate averaging, step decay, and high probability guarantees

Arxiv

0+阅读 · 2021年8月16日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

VIP会员

文章信息

相关主题

查准率/准确率

经验风险最小化

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

127+阅读 · 2021年6月4日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

一份简单《图神经网络》教程，28页ppt

一份简单《图神经网络》教程，28页ppt

专知会员服务

126+阅读 · 2020年8月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《人工智能绝不能完全自主》

《人工智能的法律与伦理：军事自主机器独特挑战的深度剖析》316页

从数据到主导：AI与兵棋推演构筑决策优势

《特洛伊木马货柜：武器化集装箱的战略威胁》最新报告

相关资讯

【优博微展2019】李志泽：简单快速的机器学习优化方法

【优博微展2019】李志泽：简单快速的机器学习优化方法

清华大学研究生教育

14+阅读 · 2019年10月8日

【泡泡汇总】CVPR2019 SLAM Paperlist

【泡泡汇总】CVPR2019 SLAM Paperlist

泡泡机器人SLAM

14+阅读 · 2019年6月12日

数据分析师应该知道的16种回归技术：弹性网络回归

数据分析师应该知道的16种回归技术：弹性网络回归

数萃大数据

91+阅读 · 2018年8月16日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

已删除

将门创投

6+阅读 · 2017年11月27日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

Using Multilevel Circulant Matrix Approximate to Speed Up Kernel Logistic Regression

Arxiv

0+阅读 · 2021年8月19日

A Recursive Logit Model with Choice Aversion and Its Application to Transportation Networks

A Recursive Logit Model with Choice Aversion and Its Application to Transportation Networks

Arxiv

0+阅读 · 2021年8月18日

Adaptive Non-parametric Estimation of Mean and Autocovariance in Regression with Dependent Errors

Arxiv

0+阅读 · 2021年8月18日

A Carleman-based numerical method for quasilinear elliptic equations with over-determined boundary data and applications

Arxiv

0+阅读 · 2021年8月18日

Statistically Near-Optimal Hypothesis Selection

Arxiv

0+阅读 · 2021年8月17日

Stochastic optimization under time drift: iterate averaging, step decay, and high probability guarantees

Arxiv

0+阅读 · 2021年8月16日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks

Arxiv

13+阅读 · 2020年6月24日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

微信扫码咨询专知VIP会员