超过网络的牛顿方法快速到统计精度 (Newton Method over Networks is Fast up to the Statistical Precision) - 专知论文

会员服务 ·

0

统计量 · Networking · 查准率/准确率 · 经验风险最小化 · 经验风险 ·

2021 年 2 月 12 日

Newton Method over Networks is Fast up to the Statistical Precision

翻译：超过网络的牛顿方法快速到统计精度

Amir Daneshmand,Gesualdo Scutari,Pavel Dvurechensky,Alexander Gasnikov

We propose a distributed cubic regularization of the Newton method for solving (constrained) empirical risk minimization problems over a network of agents, modeled as undirected graph. The algorithm employs an inexact, preconditioned Newton step at each agent's side: the gradient of the centralized loss is iteratively estimated via a gradient-tracking consensus mechanism and the Hessian is subsampled over the local data sets. No Hessian matrices are thus exchanged over the network. We derive global complexity bounds for convex and strongly convex losses. Our analysis reveals an interesting interplay between sample and iteration/communication complexity: statistically accurate solutions are achievable in roughly the same number of iterations of the centralized cubic Newton method, with a communication cost per iteration of the order of $\widetilde{\mathcal{O}}\big(1/\sqrt{1-\rho}\big)$, where $\rho$ characterizes the connectivity of the network. This demonstrates a significant communication saving with respect to that of existing, statistically oblivious, distributed Newton-based methods over networks.

翻译：我们建议对牛顿解决(受限制的)实证风险最小化问题的方法进行分布式的立方正规化,在代理商的网络上,以未定向的图表为模型。算法在每个代理商的侧面使用不精确的、有先决条件的牛顿步骤:集中损失的梯度通过梯度跟踪共识机制进行迭代估计,赫森对本地数据集进行子取样。因此,网络上没有交换赫森基质。我们从全球复杂度中得出锥形和强烈锥形损失。我们的分析揭示了抽样和迭代/通信复杂性之间的令人感兴趣的相互作用:在集中的牛顿立方法的迭代数量上,统计准确的解决方案大致可以实现,按美元全方位的顺序来计算通信成本。O ⁇ big(1/\qrt{1-\rho ⁇ big),其中美元是网络连接的特征。这显示了现有、统计上遗忘的、基于纽顿的基于网络的方法在网络上的巨大通信节约。

0

相关内容

统计量

【实用书】数据科学基础，484页pdf，Foundations of Data Science

【实用书】数据科学基础，484页pdf，Foundations of Data Science

专知会员服务

122+阅读 · 2020年5月28日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

专知会员服务

85+阅读 · 2020年2月16日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【牛津大学Yee Whye Teh 】论深度学习中的统计思维（On Statistical Thinking in Deep Learning），附49页ppt

【牛津大学Yee Whye Teh 】论深度学习中的统计思维（On Statistical Thinking in Deep Learning），附49页ppt

专知会员服务

62+阅读 · 2019年11月24日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

专知会员服务

26+阅读 · 2019年10月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

深度学习优化算法入门：二、动量、RMSProp、Adam

深度学习优化算法入门：二、动量、RMSProp、Adam

论智

10+阅读 · 2018年10月2日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

A global method for mixed categorical optimization with catalogs

Arxiv

0+阅读 · 2021年4月8日

High order asymptotic preserving Hermite WENO fast sweeping method for the steady-state $S_{N}$ transport equation

Arxiv

0+阅读 · 2021年4月8日

Zeta Correction: A New Approach to Constructing Corrected Trapezoidal Quadrature Rules for Singular Integral Operators

Arxiv

0+阅读 · 2021年4月7日

Accelerated derivative-free nonlinear least-squares applied to the estimation of Manning coefficients

Arxiv

0+阅读 · 2021年4月6日

Accelerated Gradient Tracking over Time-varying Graphs for Decentralized Optimization

Arxiv

0+阅读 · 2021年4月6日

Statistical Network Analysis with Bergm

Arxiv

0+阅读 · 2021年4月6日

On the Optimality of Batch Policy Optimization Algorithms

Arxiv

0+阅读 · 2021年4月6日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Distributed Graph Convolutional Networks

Arxiv

19+阅读 · 2020年7月13日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

VIP会员

文章信息

相关主题

查准率/准确率

经验风险最小化

相关VIP内容

【实用书】数据科学基础，484页pdf，Foundations of Data Science

【实用书】数据科学基础，484页pdf，Foundations of Data Science

专知会员服务

122+阅读 · 2020年5月28日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

统计学习理论之父Vapnik-MIT2020报告《完全学习统计理论Statistical Theory of Learning》

专知会员服务

85+阅读 · 2020年2月16日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

【牛津大学Yee Whye Teh 】论深度学习中的统计思维（On Statistical Thinking in Deep Learning），附49页ppt

【牛津大学Yee Whye Teh 】论深度学习中的统计思维（On Statistical Thinking in Deep Learning），附49页ppt

专知会员服务

62+阅读 · 2019年11月24日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

992页《初等微积分：无穷小方法》(Elementary Calculus. An Infinitesimal Approach)书籍【附下载】

专知会员服务

26+阅读 · 2019年10月28日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

《商用大语言模型的升级风险管理：国家安全运用》

自主人工智能：未来战争是否将是自主化的？

《从装备到文化：美陆军技术素养建设启示录》最新报告

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

深度学习优化算法入门：二、动量、RMSProp、Adam

深度学习优化算法入门：二、动量、RMSProp、Adam

论智

10+阅读 · 2018年10月2日

误差反向传播——RNN

误差反向传播——RNN

统计学习与视觉计算组

18+阅读 · 2018年9月6日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

【论文推荐】最新六篇主题模型相关论文—收敛率、大规模、深度主题建模、优化、情绪强度、广义动态主题模型

专知

11+阅读 · 2018年3月29日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

A global method for mixed categorical optimization with catalogs

Arxiv

0+阅读 · 2021年4月8日

High order asymptotic preserving Hermite WENO fast sweeping method for the steady-state $S_{N}$ transport equation

Arxiv

0+阅读 · 2021年4月8日

Zeta Correction: A New Approach to Constructing Corrected Trapezoidal Quadrature Rules for Singular Integral Operators

Arxiv

0+阅读 · 2021年4月7日

Accelerated derivative-free nonlinear least-squares applied to the estimation of Manning coefficients

Arxiv

0+阅读 · 2021年4月6日

Accelerated Gradient Tracking over Time-varying Graphs for Decentralized Optimization

Arxiv

0+阅读 · 2021年4月6日

Statistical Network Analysis with Bergm

Arxiv

0+阅读 · 2021年4月6日

On the Optimality of Batch Policy Optimization Algorithms

Arxiv

0+阅读 · 2021年4月6日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

Distributed Graph Convolutional Networks

Arxiv

19+阅读 · 2020年7月13日

Learning to Propagate for Graph Meta-Learning

Arxiv

14+阅读 · 2019年9月11日

微信扫码咨询专知VIP会员