自动DDL: 自动分布式深层学习, 使用简单优化通讯 (AutoDDL: Automatic Distributed Deep Learning with Asymptotically Optimal Communication) - 专知论文

会员服务 ·

0

优化器 · MoDELS · VGG · Learning · Automator ·

2023 年 1 月 17 日

AutoDDL: Automatic Distributed Deep Learning with Asymptotically Optimal Communication

翻译：自动DDL: 自动分布式深层学习, 使用简单优化通讯

Jinfan Chen,Shigang Li,Ran Gun,Jinhui Yuan,Torsten Hoefler

Recent advances in deep learning base on growing model sizes and the necessary scaling of compute power. Training such large-scale models requires an intricate combination of data-, operator-, and pipeline parallelism in complex distributed systems. We show how to use OneFlow's Split, Broadcast, and Partial Sum (SBP) tensor formulations to enable new distributed training methods with asymptotically optimal communication overheads. Using these insights, we develop AutoDDL, a distributed training framework that combines an exhaustive performance model and automated configuration search to find distributions with near-optimal communication overheads. We conduct evaluations on Multi-Node-Single-GPU and Multi-Node-Multi-GPU machines using different models, including VGG and Transformer. Compared to expert-optimized implementations, AutoDDL reduces the end-to-end training time by up to 31.1\% and 10\% for Transformer and up to 17.7\% and 71.5\% for VGG on the two different systems, respectively.

翻译：根据不断增长的模型规模和计算能力的必要规模,在深层次学习基础方面最近的进展。培训这类大型模型需要在复杂的分布系统中将数据、操作员和管道平行结合起来。我们展示了如何使用OneFlow的分解、广播和部分总和(SBP)的发价配方,以便能够采用新的分布式培训方法,同时使用非临时最佳通信间接费用。我们利用这些洞察力开发AutoDDL,这是一个分布式培训框架,将详尽的性能模型和自动配置搜索结合起来,以找到接近最佳的通信间接费用的分布。我们使用不同的模型,包括VGG和变异器,对多节-Sing-GPU和多点-Multi-GPU的机器进行评估。与专家优化实施相比,AutoDDL分别将变压器的端对端培训时间缩短到31.1 ⁇ 和10 ⁇ 和17.7 ⁇ 和VGGG在两个不同系统中的对终端培训时间分别缩短至端。

0

相关内容

优化器

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

体外循环细胞干预治疗急性肾损伤的机制及疗效研究

国家自然科学基金

0+阅读 · 2014年12月31日

细胞周期蛋白依赖性激酶11在人乳腺癌细胞增殖中的作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

PCL聚合物纳米粒子控释HIF-1α诱导OSTERIX修饰的iPS细胞成骨作用及再血管化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

自旋与机械耦合效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hippo通路在急性肾损伤发病中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于时间隔离的安全关键控制网络防危调度研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Petri网灵巧信标的自动制造系统死锁控制策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

辅助生殖技术对人早期着床前后胚胎印记基因甲基化模式的影响及与胚胎停止发育相关性的研究

国家自然科学基金

0+阅读 · 2011年12月31日

ICF中高能电子和离子输运的Monte-Carlo算法研究和程序研制

国家自然科学基金

0+阅读 · 2009年12月31日

Communication Size Reduction of Federated Learning using Neural ODE Models

Communication Size Reduction of Federated Learning using Neural ODE Models

Arxiv

0+阅读 · 2023年3月10日

Conjugate Natural Selection: Fisher-Rao Natural Gradient Descent Optimally Approximates Evolutionary Dynamics and Continuous Bayesian Inference

Arxiv

0+阅读 · 2023年3月10日

Combining visibility analysis and deep learning for refinement of semantic 3D building models by conflict classification

Arxiv

0+阅读 · 2023年3月10日

Distributed and Deep Vertical Federated Learning with Big Data

Arxiv

0+阅读 · 2023年3月8日

Runtime Support for Performance Portability on Heterogeneous Distributed Platforms

Arxiv

0+阅读 · 2023年3月8日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

A Comprehensive Survey on Community Detection with Deep Learning

Arxiv

14+阅读 · 2021年5月26日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Communication Size Reduction of Federated Learning using Neural ODE Models

Communication Size Reduction of Federated Learning using Neural ODE Models

Arxiv

0+阅读 · 2023年3月10日

Conjugate Natural Selection: Fisher-Rao Natural Gradient Descent Optimally Approximates Evolutionary Dynamics and Continuous Bayesian Inference

Arxiv

0+阅读 · 2023年3月10日

Combining visibility analysis and deep learning for refinement of semantic 3D building models by conflict classification

Arxiv

0+阅读 · 2023年3月10日

Distributed and Deep Vertical Federated Learning with Big Data

Arxiv

0+阅读 · 2023年3月8日

Runtime Support for Performance Portability on Heterogeneous Distributed Platforms

Arxiv

0+阅读 · 2023年3月8日

Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better

Arxiv

28+阅读 · 2021年6月16日

A Comprehensive Survey on Community Detection with Deep Learning

Arxiv

14+阅读 · 2021年5月26日

Coding for Distributed Multi-Agent Reinforcement Learning

Arxiv

32+阅读 · 2021年1月7日

A Survey on Distributed Machine Learning

Arxiv

45+阅读 · 2019年12月20日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

动态Gr？bner 基与GVW算法

国家自然科学基金

0+阅读 · 2014年12月31日

体外循环细胞干预治疗急性肾损伤的机制及疗效研究

国家自然科学基金

0+阅读 · 2014年12月31日

细胞周期蛋白依赖性激酶11在人乳腺癌细胞增殖中的作用及分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

PCL聚合物纳米粒子控释HIF-1α诱导OSTERIX修饰的iPS细胞成骨作用及再血管化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

自旋与机械耦合效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hippo通路在急性肾损伤发病中的作用及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于时间隔离的安全关键控制网络防危调度研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Petri网灵巧信标的自动制造系统死锁控制策略研究

国家自然科学基金

0+阅读 · 2012年12月31日

辅助生殖技术对人早期着床前后胚胎印记基因甲基化模式的影响及与胚胎停止发育相关性的研究

国家自然科学基金

0+阅读 · 2011年12月31日

ICF中高能电子和离子输运的Monte-Carlo算法研究和程序研制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员