从非国际开发组织数据中分散学习的跨部门聚合 (Cross-Gradient Aggregation for Decentralized Learning from Non-IID data) - 专知论文

会员服务 ·

0

Performer · 学成 · state-of-the-art · 可约的 · MoDELS ·

2021 年 3 月 2 日

Cross-Gradient Aggregation for Decentralized Learning from Non-IID data

翻译：从非国际开发组织数据中分散学习的跨部门聚合

Yasaman Esfandiari,Sin Yong Tan,Zhanhong Jiang,Aditya Balu,Ethan Herron,Chinmay Hegde,Soumik Sarkar

Decentralized learning enables a group of collaborative agents to learn models using a distributed dataset without the need for a central parameter server. Recently, decentralized learning algorithms have demonstrated state-of-the-art results on benchmark data sets, comparable with centralized algorithms. However, the key assumption to achieve competitive performance is that the data is independently and identically distributed (IID) among the agents which, in real-life applications, is often not applicable. Inspired by ideas from continual learning, we propose Cross-Gradient Aggregation (CGA), a novel decentralized learning algorithm where (i) each agent aggregates cross-gradient information, i.e., derivatives of its model with respect to its neighbors' datasets, and (ii) updates its model using a projected gradient based on quadratic programming (QP). We theoretically analyze the convergence characteristics of CGA and demonstrate its efficiency on non-IID data distributions sampled from the MNIST and CIFAR-10 datasets. Our empirical comparisons show superior learning performance of CGA over existing state-of-the-art decentralized learning algorithms, as well as maintaining the improved performance under information compression to reduce peer-to-peer communication overhead.

翻译：分散化学习算法最近展示了基准数据集的最先进结果,与集中式算法相类似;然而,实现竞争性业绩的关键假设是,数据在实际应用中往往不适用的代理商之间是独立和同样分布的(IID)的。根据不断学习的想法,我们提议跨级聚合法(CGA),这是一种新的分散化学习算法,其中(一) 每种代理商汇总跨级信息,即其模型在邻国数据集方面的衍生物,以及(二) 使用基于四边式程序(QP)的预测梯度更新其模型。我们从理论上分析CGA的趋同特征,并展示其在从MNIST和CIFAR-10数据集抽样的非IID数据分布上的效率。我们的经验比较显示,CGA相对于现有状态分散化学习算法的学习表现优异,同时在信息压缩到降低同侪之间通信水平方面保持更好的业绩。

1

相关内容

Performer

首篇「课程学习（Curriculum Learning)」2021综述论文

首篇「课程学习（Curriculum Learning)」2021综述论文

专知会员服务

50+阅读 · 2021年1月31日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

专知会员服务

10+阅读 · 2020年2月12日

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

专知会员服务

46+阅读 · 2019年12月13日

【课程】纽约大学 DS-GA 1003 Machine Learning

【课程】纽约大学 DS-GA 1003 Machine Learning

专知会员服务

46+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年4月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Semi-Decentralized Federated Edge Learning for Fast Convergence on Non-IID Data

Arxiv

0+阅读 · 2021年4月27日

Communication-Efficient Federated Learning with Dual-Side Low-Rank Compression

Arxiv

0+阅读 · 2021年4月26日

PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation

Arxiv

0+阅读 · 2021年4月26日

DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training

Arxiv

1+阅读 · 2021年4月24日

Accelerating Federated Learning over Reliability-Agnostic Clients in Mobile Edge Computing Systems

Arxiv

0+阅读 · 2021年4月23日

Decentralized Federated Averaging

Arxiv

0+阅读 · 2021年4月23日

Decentralized Multi-Agents by Imitation of a Centralized Controller

Arxiv

0+阅读 · 2021年4月22日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Personalized Cross-Silo Federated Learning on Non-IID Data

Personalized Cross-Silo Federated Learning on Non-IID Data

Arxiv

10+阅读 · 2021年1月7日

Domain Aggregation Networks for Multi-Source Domain Adaptation

Domain Aggregation Networks for Multi-Source Domain Adaptation

Arxiv

4+阅读 · 2019年9月11日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

首篇「课程学习（Curriculum Learning)」2021综述论文

首篇「课程学习（Curriculum Learning)」2021综述论文

专知会员服务

50+阅读 · 2021年1月31日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

【ICML2020提交论文】Learning@home:众包与分散Mixture-of-Experts训练的神经网络（Learning@home: Crowdsourced Training of Large Neural Networks with Decentralized Mixture-of-Experts）

专知会员服务

10+阅读 · 2020年2月12日

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

【贝叶斯规则因果推理】《Causal Inference with Bayes Rule》by Finn Lattimore, David Rohde

专知会员服务

46+阅读 · 2019年12月13日

【课程】纽约大学 DS-GA 1003 Machine Learning

【课程】纽约大学 DS-GA 1003 Machine Learning

专知会员服务

46+阅读 · 2019年10月29日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年4月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

Semi-Decentralized Federated Edge Learning for Fast Convergence on Non-IID Data

Arxiv

0+阅读 · 2021年4月27日

Communication-Efficient Federated Learning with Dual-Side Low-Rank Compression

Arxiv

0+阅读 · 2021年4月26日

PanGu-$α$: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel Computation

Arxiv

0+阅读 · 2021年4月26日

DecentLaM: Decentralized Momentum SGD for Large-batch Deep Training

Arxiv

1+阅读 · 2021年4月24日

Accelerating Federated Learning over Reliability-Agnostic Clients in Mobile Edge Computing Systems

Arxiv

0+阅读 · 2021年4月23日

Decentralized Federated Averaging

Arxiv

0+阅读 · 2021年4月23日

Decentralized Multi-Agents by Imitation of a Centralized Controller

Arxiv

0+阅读 · 2021年4月22日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Personalized Cross-Silo Federated Learning on Non-IID Data

Personalized Cross-Silo Federated Learning on Non-IID Data

Arxiv

10+阅读 · 2021年1月7日

Domain Aggregation Networks for Multi-Source Domain Adaptation

Domain Aggregation Networks for Multi-Source Domain Adaptation

Arxiv

4+阅读 · 2019年9月11日

微信扫码咨询专知VIP会员