通信效率高和拜占庭-暴力传播学习与错误反馈 (Communication-Efficient and Byzantine-Robust Distributed Learning with Error Feedback) - 专知论文

会员服务 ·

0

Performer · 错误率 · 统计量 · 学成 · 稀疏化 ·

2021 年 8 月 14 日

Communication-Efficient and Byzantine-Robust Distributed Learning with Error Feedback

翻译：通信效率高和拜占庭-暴力传播学习与错误反馈

Avishek Ghosh,Raj Kumar Maity,Swanand Kadhe,Arya Mazumdar,Kannan Ramchandran

We develop a communication-efficient distributed learning algorithm that is robust against Byzantine worker machines. We propose and analyze a distributed gradient-descent algorithm that performs a simple thresholding based on gradient norms to mitigate Byzantine failures. We show the (statistical) error-rate of our algorithm matches that of Yin et al.~\cite{dong}, which uses more complicated schemes (coordinate-wise median, trimmed mean). Furthermore, for communication efficiency, we consider a generic class of $\delta$-approximate compressors from Karimireddi et al.~\cite{errorfeed} that encompasses sign-based compressors and top-$k$ sparsification. Our algorithm uses compressed gradients and gradient norms for aggregation and Byzantine removal respectively. We establish the statistical error rate for non-convex smooth loss functions. We show that, in certain range of the compression factor $\delta$, the (order-wise) rate of convergence is not affected by the compression operation. Moreover, we analyze the compressed gradient descent algorithm with error feedback (proposed in \cite{errorfeed}) in a distributed setting and in the presence of Byzantine worker machines. We show that exploiting error feedback improves the statistical error rate. Finally, we experimentally validate our results and show good performance in convergence for convex (least-square regression) and non-convex (neural network training) problems.

翻译：我们开发了一种对拜占庭工人机器具有强大的通信高效分布式学习算法。我们提出并分析一种分布式梯度-白日算法,根据梯度规范进行简单的阈值,以减少拜占庭失败。我们展示了我们算法与Yin et al. ⁇ cite{dong} 匹配Yin 和 al. ⁇ cite{dong} 的(统计性)错误率,这种算法使用更为复杂的计划(准中位数,斜度) 。此外,为了通信效率,我们考虑了一个通用类别,即Karimireddi et al. ⁇ cite{errfeed}, 包括基于信号的压缩压缩器和顶价-k$ sloadarization。我们的算法使用压缩梯度和梯度规则来合并和Byzantine的清除。我们为非convex 平稳损失功能建立了统计错误率。我们在某些压缩系数 $\deltata, (顺序) 趋联调率不受压缩操作的影响。此外,我们用一个简化的梯度的梯度下梯度下降运算算法与错误反馈,我们用一个错误显示了我们实验室的计算结果。

0

相关内容

Performer

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

专知会员服务

124+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习在材料科学中的应用综述，21页pdf

机器学习在材料科学中的应用综述，21页pdf

专知会员服务

49+阅读 · 2019年9月24日

经典回顾 | Collaborative Metric Learning

经典回顾 | Collaborative Metric Learning

机器学习与推荐算法

6+阅读 · 2020年9月18日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

FedDQ: Communication-Efficient Federated Learning with Descending Quantization

Arxiv

0+阅读 · 2021年10月13日

Communication-Efficient Triangle Counting under Local Differential Privacy

Arxiv

0+阅读 · 2021年10月13日

Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria

Arxiv

0+阅读 · 2021年10月12日

Federated Learning over Wireless Device-to-Device Networks: Algorithms and Convergence Analysis

Arxiv

0+阅读 · 2021年10月12日

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration

Arxiv

0+阅读 · 2021年10月12日

One-Bit Matrix Completion with Differential Privacy

Arxiv

0+阅读 · 2021年10月11日

SQuARM-SGD: Communication-Efficient Momentum SGD for Decentralized Optimization

Arxiv

0+阅读 · 2021年10月11日

The Skellam Mechanism for Differentially Private Federated Learning

Arxiv

0+阅读 · 2021年10月11日

Accelerated Gradient Descent Learning over Multiple Access Fading Channels

Accelerated Gradient Descent Learning over Multiple Access Fading Channels

Arxiv

0+阅读 · 2021年10月8日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

VIP会员

文章信息

相关主题

相关VIP内容

分布外泛化(Out-Of-Distribution Generalization) 综述论文，22页pdf240篇文献

专知会员服务

64+阅读 · 2021年9月2日

【图与几何深度学习】Graph and geometric deep learning，49页ppt

【图与几何深度学习】Graph and geometric deep learning，49页ppt

专知会员服务

65+阅读 · 2021年4月24日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【MIT】反偏差对比学习，Debiased Contrastive Learning

【MIT】反偏差对比学习，Debiased Contrastive Learning

专知会员服务

91+阅读 · 2020年7月4日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

【文献综述】分布式机器学习综述论文，33页pdf，A Survey on Distributed Machine Learning

专知会员服务

124+阅读 · 2019年12月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

机器学习在材料科学中的应用综述，21页pdf

机器学习在材料科学中的应用综述，21页pdf

专知会员服务

49+阅读 · 2019年9月24日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

经典回顾 | Collaborative Metric Learning

经典回顾 | Collaborative Metric Learning

机器学习与推荐算法

6+阅读 · 2020年9月18日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

FedDQ: Communication-Efficient Federated Learning with Descending Quantization

Arxiv

0+阅读 · 2021年10月13日

Communication-Efficient Triangle Counting under Local Differential Privacy

Arxiv

0+阅读 · 2021年10月13日

Better Regularization for Sequential Decision Spaces: Fast Convergence Rates for Nash, Correlated, and Team Equilibria

Arxiv

0+阅读 · 2021年10月12日

Federated Learning over Wireless Device-to-Device Networks: Algorithms and Convergence Analysis

Arxiv

0+阅读 · 2021年10月12日

Decentralized Cooperative Multi-Agent Reinforcement Learning with Exploration

Arxiv

0+阅读 · 2021年10月12日

One-Bit Matrix Completion with Differential Privacy

Arxiv

0+阅读 · 2021年10月11日

SQuARM-SGD: Communication-Efficient Momentum SGD for Decentralized Optimization

Arxiv

0+阅读 · 2021年10月11日

The Skellam Mechanism for Differentially Private Federated Learning

Arxiv

0+阅读 · 2021年10月11日

Accelerated Gradient Descent Learning over Multiple Access Fading Channels

Accelerated Gradient Descent Learning over Multiple Access Fading Channels

Arxiv

0+阅读 · 2021年10月8日

Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

Arxiv

9+阅读 · 2018年7月16日

微信扫码咨询专知VIP会员