集群模式中非同步、完全分散的 SGD (Asynchronous Fully-Decentralized SGD in the Cluster-Based Model) - 专知论文

会员服务 ·

0

SGD · 优化器 · 泛函 · 簇 · MoDELS ·

2022 年 3 月 27 日

Asynchronous Fully-Decentralized SGD in the Cluster-Based Model

翻译：集群模式中非同步、完全分散的 SGD

Hagit Attiya,Noa Schiller

This paper presents fault-tolerant asynchronous Stochastic Gradient Descent (SGD) algorithms. SGD is widely used for approximating the minimum of a cost function $Q$, as a core part of optimization and learning algorithms. Our algorithms are designed for the cluster-based model, which combines message-passing and shared-memory communication layers. Processes may fail by crashing, and the algorithm inside each cluster is wait-free, using only reads and writes. For a strongly convex function $Q$, our algorithm tolerates any number of failures, and provides convergence rate that yields the maximal distributed acceleration over the optimal convergence rate of sequential SGD. For arbitrary functions, the convergence rate has an additional term that depends on the maximal difference between the parameters at the same iteration. (This holds under standard assumptions on $Q$.) In this case, the algorithm obtains the same convergence rate as sequential SGD, up to a logarithmic factor. This is achieved by using, at each iteration, a multidimensional approximate agreement algorithm, tailored for the cluster-based model. The algorithm for arbitrary functions requires that at least a majority of the clusters contain at least one nonfaulty process. We prove that this condition is necessary when optimizing some non-convex functions.

翻译：本文展示了错误容忍性非同步的 Stochastectic Gladient Emple (SGD) 算法。 SGD 被广泛用作优化和学习算法的核心部分,作为优化和学习算法的核心部分,用于接近成本函数的最小分配速度($Q) 。我们的算法是为集群模型设计的,该模型结合了信息传递和共享- 模拟通信层。崩溃过程可能失败, 每个组内的算法没有等待, 仅使用阅读和写作。对于一个强烈的 convex 函数, 我们的算法可以容忍任何数目的失败, 并提供趋同率, 使最大分配速度超过顺序 SGD 的最佳趋同率。对于任意函数, 趋同率有一个额外的术语, 取决于同一迭代的参数之间的最大差异。 (根据标准假设, $Q美元。 ) 在这种情况下, 算法获得与顺序 SGDGD( SGD) 相同的趋同率, 直至一个对数因素。这是通过在每次试算时使用某种多层面的近似协议算法, 在最不必要的情况下, 在最差的组合模型上, 最不需要的模型中, 最不优化的模型中, 需要的算法功能需要一种最不需要的。

0

相关内容

SGD

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

详解PyTorch中的ModuleList和Sequential

详解PyTorch中的ModuleList和Sequential

极市平台

0+阅读 · 2022年1月28日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

AuNPs-PDMS复合微薄膜电导式表面应力生物传感器制备及力电传感机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

极大似然minwise哈希估计子研究

国家自然科学基金

0+阅读 · 2013年12月31日

焊缝典型缺陷磁记忆特征提取与反演的定量化基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

ZrC基陶瓷中纳米相的析出行为与机理及对性能的影响

国家自然科学基金

0+阅读 · 2011年12月31日

基于社会与精英共治演化模型的群体智能算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

双相不锈钢中σ30456;析出行为的相图热力学和动力学

国家自然科学基金

0+阅读 · 2009年12月31日

网络环境下基于群体智能方法的虚拟机协同平台研究

国家自然科学基金

0+阅读 · 2009年12月31日

分布式数据流的集成模式挖掘模型和概念漂移检测算法研究

国家自然科学基金

2+阅读 · 2008年12月31日

CPS标准下AGC的最优松驰控制及其马尔可夫决策过程

国家自然科学基金

1+阅读 · 2008年12月31日

薄板复合材料粘接缺陷超声检测量化识别研究

国家自然科学基金

0+阅读 · 2008年12月31日

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

Scalable Sampling for Nonsymmetric Determinantal Point Processes

Arxiv

0+阅读 · 2022年4月19日

Basilic: Resilient Optimal Consensus Protocols With Benign and Deceitful Faults

Arxiv

0+阅读 · 2022年4月19日

On Arbitrary Compression for Decentralized Consensus and Stochastic Optimization over Directed Networks

Arxiv

0+阅读 · 2022年4月18日

A Concise Function Representation for Faster Exact MPE and Constrained Optimisation in Graphical Models

Arxiv

0+阅读 · 2022年4月17日

SASG: Sparsification with Adaptive Stochastic Gradients for Communication-efficient Distributed Learning

Arxiv

0+阅读 · 2022年4月17日

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

Arxiv

0+阅读 · 2022年4月15日

Energy-Efficient Data Transfer Optimization via Decision-Tree Based Uncertainty Reduction

Arxiv

0+阅读 · 2022年4月15日

Proximal nested sampling for high-dimensional Bayesian model selection

Proximal nested sampling for high-dimensional Bayesian model selection

Arxiv

0+阅读 · 2022年4月15日

A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow

Arxiv

0+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

相关VIP内容

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

深度强化学习策略梯度教程，53页ppt

深度强化学习策略梯度教程，53页ppt

专知会员服务

184+阅读 · 2020年2月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

详解PyTorch中的ModuleList和Sequential

详解PyTorch中的ModuleList和Sequential

极市平台

0+阅读 · 2022年1月28日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

【论文推荐】最新七篇强化学习相关论文—逻辑约束、综述、多任务深度强化学习、参数服务器、事件抽取、分层强化学习、过拟合研究

专知

25+阅读 · 2018年4月29日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning

Arxiv

0+阅读 · 2022年4月20日

Scalable Sampling for Nonsymmetric Determinantal Point Processes

Arxiv

0+阅读 · 2022年4月19日

Basilic: Resilient Optimal Consensus Protocols With Benign and Deceitful Faults

Arxiv

0+阅读 · 2022年4月19日

On Arbitrary Compression for Decentralized Consensus and Stochastic Optimization over Directed Networks

Arxiv

0+阅读 · 2022年4月18日

A Concise Function Representation for Faster Exact MPE and Constrained Optimisation in Graphical Models

Arxiv

0+阅读 · 2022年4月17日

SASG: Sparsification with Adaptive Stochastic Gradients for Communication-efficient Distributed Learning

Arxiv

0+阅读 · 2022年4月17日

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

Arxiv

0+阅读 · 2022年4月15日

Energy-Efficient Data Transfer Optimization via Decision-Tree Based Uncertainty Reduction

Arxiv

0+阅读 · 2022年4月15日

Proximal nested sampling for high-dimensional Bayesian model selection

Proximal nested sampling for high-dimensional Bayesian model selection

Arxiv

0+阅读 · 2022年4月15日

A Reinforcement Learning Approach to Parameter Selection for Distributed Optimal Power Flow

Arxiv

0+阅读 · 2022年4月15日

相关基金

AuNPs-PDMS复合微薄膜电导式表面应力生物传感器制备及力电传感机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

极大似然minwise哈希估计子研究

国家自然科学基金

0+阅读 · 2013年12月31日

焊缝典型缺陷磁记忆特征提取与反演的定量化基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

ZrC基陶瓷中纳米相的析出行为与机理及对性能的影响

国家自然科学基金

0+阅读 · 2011年12月31日

基于社会与精英共治演化模型的群体智能算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

双相不锈钢中σ30456;析出行为的相图热力学和动力学

国家自然科学基金

0+阅读 · 2009年12月31日

网络环境下基于群体智能方法的虚拟机协同平台研究

国家自然科学基金

0+阅读 · 2009年12月31日

分布式数据流的集成模式挖掘模型和概念漂移检测算法研究

国家自然科学基金

2+阅读 · 2008年12月31日

CPS标准下AGC的最优松驰控制及其马尔可夫决策过程

国家自然科学基金

1+阅读 · 2008年12月31日

薄板复合材料粘接缺陷超声检测量化识别研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员