通过多阶段优化降低联邦学习的通信成本 (Reducing the Communication Cost of Federated Learning through Multistage Optimization) - 专知论文

会员服务 ·

0

可约的 · 优化器 · 联邦学习 · 学成 · 代价 ·

2021 年 8 月 16 日

Reducing the Communication Cost of Federated Learning through Multistage Optimization

翻译：通过多阶段优化降低联邦学习的通信成本

Charlie Hou,Kiran K. Thekumparampil,Giulia Fanti,Sewoong Oh

A central question in federated learning (FL) is how to design optimization algorithms that minimize the communication cost of training a model over heterogeneous data distributed across many clients. A popular technique for reducing communication is the use of local steps, where clients take multiple optimization steps over local data before communicating with the server (e.g., FedAvg, SCAFFOLD). This contrasts with centralized methods, where clients take one optimization step per communication round (e.g., Minibatch SGD). A recent lower bound on the communication complexity of first-order methods shows that centralized methods are optimal over highly-heterogeneous data, whereas local methods are optimal over purely homogeneous data [Woodworth et al., 2020]. For intermediate heterogeneity levels, no algorithm is known to match the lower bound. In this paper, we propose a multistage optimization scheme that nearly matches the lower bound across all heterogeneity levels. The idea is to first run a local method up to a heterogeneity-induced error floor; next, we switch to a centralized method for the remaining steps. Our analysis may help explain empirically-successful stepsize decay methods in FL [Charles et al., 2020; Reddi et al., 2020]. We demonstrate the scheme's practical utility in image classification tasks.

翻译：联合学习(FL)的一个中心问题是,如何设计优化算法,最大限度地减少培训模型的通信成本,使其与许多客户分布的不同数据相比,培训模式的通信成本最小化。减少通信的流行技术是使用本地步骤,客户在与服务器(例如FedAvg、SCAFFFOLD)沟通之前对本地数据采取多重优化步骤。这与集中方法形成对比,客户每一轮通信采取一个优化步骤(例如Minibatch SGD),最近对一级方法通信复杂性的较低约束显示,集中方法比高度偏差的数据最理想,而地方方法则比纯同质数据[Woodworth等人,2020年]的最佳。对于中间异质级别而言,没有已知的算法可以匹配较低约束的当地数据。在本文件中,我们提议了一个多阶段优化计划,客户每回合采取一个更低约束的跨度(例如Minibatch SGDD) 。设想首先将本地方法运行到异质导致的错误底层;接下来,我们将其余步骤转换为集中方法[Woodworth等人等人等人等人等人等人等人等人等人,2020年和Redalcessalchegration]。我们的分析可能解释2020年的实际-Lassulatealphisoldaldalx。

1

相关内容

可约的

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【斯坦福大学】深度学习技巧速查清单《CS 230 - Deep Learning Tips and Tricks Cheatsheet》

【斯坦福大学】深度学习技巧速查清单《CS 230 - Deep Learning Tips and Tricks Cheatsheet》

专知会员服务

29+阅读 · 2019年12月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

46+阅读 · 2020年12月2日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

生物探索

3+阅读 · 2018年2月10日

FedDQ: Communication-Efficient Federated Learning with Descending Quantization

Arxiv

0+阅读 · 2021年10月13日

Communication-Efficient Online Federated Learning Framework for Nonlinear Regression

Arxiv

0+阅读 · 2021年10月13日

Reduced-Order Multiscale Modeling of Plastic Deformations in 3D Cast Metallic Alloys with Spatially Varying Microstructures

Arxiv

0+阅读 · 2021年10月12日

Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses

Arxiv

0+阅读 · 2021年10月12日

Homogeneous Learning: Self-Attention Decentralized Deep Learning

Arxiv

0+阅读 · 2021年10月11日

Privacy For Free: Wireless Federated Learning Via Uncoded Transmission With Adaptive Power Control

Arxiv

0+阅读 · 2021年10月11日

Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Arxiv

0+阅读 · 2021年10月10日

Solon: Communication-efficient Byzantine-resilient Distributed Training via Redundant Gradients

Arxiv

0+阅读 · 2021年10月9日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Arxiv

5+阅读 · 2020年7月31日

VIP会员

文章信息

相关主题

相关VIP内容

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【斯坦福大学】Gradient Surgery for Multi-Task Learning

【斯坦福大学】Gradient Surgery for Multi-Task Learning

专知会员服务

47+阅读 · 2020年1月23日

【斯坦福大学】深度学习技巧速查清单《CS 230 - Deep Learning Tips and Tricks Cheatsheet》

【斯坦福大学】深度学习技巧速查清单《CS 230 - Deep Learning Tips and Tricks Cheatsheet》

专知会员服务

29+阅读 · 2019年12月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《巡飞弹药（爆炸性无人机）威胁态势分析》最新24页报告

《军用后勤无人机：破解战场运输挑战的创新方案》

人工智能战争：以色列、伊朗与新型AI战争形态

《俄乌战争：现代战争未来的启示与经验》

相关资讯

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

46+阅读 · 2020年12月2日

深度卷积神经网络中的降采样

深度卷积神经网络中的降采样

极市平台

12+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

生物探索

3+阅读 · 2018年2月10日

相关论文

FedDQ: Communication-Efficient Federated Learning with Descending Quantization

Arxiv

0+阅读 · 2021年10月13日

Communication-Efficient Online Federated Learning Framework for Nonlinear Regression

Arxiv

0+阅读 · 2021年10月13日

Reduced-Order Multiscale Modeling of Plastic Deformations in 3D Cast Metallic Alloys with Spatially Varying Microstructures

Arxiv

0+阅读 · 2021年10月12日

Private Federated Learning Without a Trusted Server: Optimal Algorithms for Convex Losses

Arxiv

0+阅读 · 2021年10月12日

Homogeneous Learning: Self-Attention Decentralized Deep Learning

Arxiv

0+阅读 · 2021年10月11日

Privacy For Free: Wireless Federated Learning Via Uncoded Transmission With Adaptive Power Control

Arxiv

0+阅读 · 2021年10月11日

Pareto Optimization for Subset Selection with Dynamic Cost Constraints

Arxiv

0+阅读 · 2021年10月10日

Solon: Communication-efficient Byzantine-resilient Distributed Training via Redundant Gradients

Arxiv

0+阅读 · 2021年10月9日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Arxiv

5+阅读 · 2020年7月31日

微信扫码咨询专知VIP会员