通讯效率地方权力下放的SGD方法 (Communication-Efficient Local Decentralized SGD Methods) - 专知论文

会员服务 ·

0

SGD · 周期的 · 相互独立的 · 评论员 · 最优化 ·

2021 年 4 月 5 日

Communication-Efficient Local Decentralized SGD Methods

翻译：通讯效率地方权力下放的SGD方法

Xiang Li,Wenhao Yang,Shusen Wang,Zhihua Zhang

Recently, the technique of local updates is a powerful tool in centralized settings to improve communication efficiency via periodical communication. For decentralized settings, it is still unclear how to efficiently combine local updates and decentralized communication. In this work, we propose an algorithm named as LD-SGD, which incorporates arbitrary update schemes that alternate between multiple Local updates and multiple Decentralized SGDs, and provide an analytical framework for LD-SGD. Under the framework, we present a sufficient condition to guarantee the convergence. We show that LD-SGD converges to a critical point for a wide range of update schemes when the objective is non-convex and the training data are non-identically independent distributed. Moreover, our framework brings many insights into the design of update schemes for decentralized optimization. As examples, we specify two update schemes and show how they help improve communication efficiency. Specifically, the first scheme alternates the number of local and global update steps. From our analysis, the ratio of the number of local updates to that of decentralized SGD trades off communication and computation. The second scheme is to periodically shrink the length of local updates. We show that the decaying strategy helps improve communication efficiency both theoretically and empirically.

翻译：最近,地方更新技术是中央环境中通过定期通信提高通信效率的有力工具。对于分散化环境,仍然不清楚如何有效地将地方更新与分散化通信结合起来。在这项工作中,我们提议了一个称为LD-SGD的算法,它包含多种地方更新和多分散化的 SGD 之间的任意更新计划,为LD-SGD 提供了一个分析框架。在这个框架内,我们提出了一个足够的条件来保证统一。我们显示,LD-SGD在目标为非混凝土和训练数据不明显独立分布的情况下,会汇合到一系列广泛的更新计划的关键点。此外,我们的框架为设计权力下放优化的更新计划提供了许多见解。作为例子,我们指定了两个更新计划,并展示了它们如何帮助提高通信效率。具体地说,第一个计划替代了地方和全球更新步骤的数量。我们的分析表明,地方更新的数量与分散化的SGD交易在通信和计算上的比例。第二个计划是定期缩短当地更新的时间长度。我们表明,腐蚀的战略有助于提高理论上和实践中的沟通效率。

0

相关内容

SGD

【WWW2021 】洛伦兹图卷积神经网络

专知会员服务

44+阅读 · 2021年5月26日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

元学习(meta learning) 最新进展综述论文

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【2020关键词提取】使用多个本地功能从单个文档中提取关键字，YAKE! Keyword extraction from single documents using multiple local features

【2020关键词提取】使用多个本地功能从单个文档中提取关键字，YAKE! Keyword extraction from single documents using multiple local features

专知会员服务

26+阅读 · 2020年5月2日

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

专知会员服务

57+阅读 · 2020年3月13日

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

专知会员服务

10+阅读 · 2020年2月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

将门创投

7+阅读 · 2018年4月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization

Arxiv

0+阅读 · 2021年5月31日

Energy Efficiency Optimization for Multi-cell Massive MIMO: Centralized and Distributed Power Allocation Algorithms

Arxiv

0+阅读 · 2021年5月31日

Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization

Arxiv

0+阅读 · 2021年5月31日

Blockchain-Based Decentralized Energy Management Platform for Residential Distributed Energy Resources in A Virtual Power Plant

Arxiv

0+阅读 · 2021年5月31日

PPT: A Privacy-Preserving Global Model Training Protocol for Federated Learning in P2P Networks

Arxiv

0+阅读 · 2021年5月30日

Trade-offs in Decentralized Multi-Antenna Architectures: The WAX Decomposition

Arxiv

0+阅读 · 2021年5月29日

BE-RAN: Blockchain-enabled Open RAN with Decentralized Identity Management and Privacy-Preserving Communication

Arxiv

0+阅读 · 2021年5月29日

Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference

Arxiv

0+阅读 · 2021年5月28日

Asynchronous Byzantine Machine Learning (the case of SGD)

Arxiv

3+阅读 · 2018年7月9日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

【WWW2021 】洛伦兹图卷积神经网络

专知会员服务

44+阅读 · 2021年5月26日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

元学习(meta learning) 最新进展综述论文

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【2020关键词提取】使用多个本地功能从单个文档中提取关键字，YAKE! Keyword extraction from single documents using multiple local features

【2020关键词提取】使用多个本地功能从单个文档中提取关键字，YAKE! Keyword extraction from single documents using multiple local features

专知会员服务

26+阅读 · 2020年5月2日

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

【综述】超参数优化:算法和应用综述，Hyper-Parameter Optimization: A Review of Algorithms and Applications

专知会员服务

57+阅读 · 2020年3月13日

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

专知会员服务

10+阅读 · 2020年2月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

在无标注条件下适配视觉—语言模型：全面综述

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

已删除

将门创投

7+阅读 · 2018年4月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization

Arxiv

0+阅读 · 2021年5月31日

Energy Efficiency Optimization for Multi-cell Massive MIMO: Centralized and Distributed Power Allocation Algorithms

Arxiv

0+阅读 · 2021年5月31日

Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization

Arxiv

0+阅读 · 2021年5月31日

Blockchain-Based Decentralized Energy Management Platform for Residential Distributed Energy Resources in A Virtual Power Plant

Arxiv

0+阅读 · 2021年5月31日

PPT: A Privacy-Preserving Global Model Training Protocol for Federated Learning in P2P Networks

Arxiv

0+阅读 · 2021年5月30日

Trade-offs in Decentralized Multi-Antenna Architectures: The WAX Decomposition

Arxiv

0+阅读 · 2021年5月29日

BE-RAN: Blockchain-enabled Open RAN with Decentralized Identity Management and Privacy-Preserving Communication

Arxiv

0+阅读 · 2021年5月29日

Optimal Model Placement and Online Model Splitting for Device-Edge Co-Inference

Arxiv

0+阅读 · 2021年5月28日

Asynchronous Byzantine Machine Learning (the case of SGD)

Arxiv

3+阅读 · 2018年7月9日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

微信扫码咨询专知VIP会员