TAP: 通过Tensor自动平行化加快大型DNN培训 (TAP: Accelerating Large-Scale DNN Training Through Tensor Automatic Parallelisation)

Model parallelism has become necessary to train large neural networks. However, finding a suitable model parallel schedule for an arbitrary neural network is a non-trivial task due to the exploding search space. In this work, we present a model parallelism framework TAP that automatically searches for the best data and tensor parallel schedules. Leveraging the key insight that a neural network can be represented as a directed acyclic graph, within which may only exist a limited set of frequent subgraphs, we design a graph pruning algorithm to fold the search space efficiently. TAP runs at sub-linear complexity concerning the neural network size. Experiments show that TAP is $20\times- 160\times$ faster than the state-of-the-art automatic parallelism framework, and the performance of its discovered schedules is competitive with the expert-engineered ones.

翻译：模型平行性对于培训大型神经网络是必要的。但是,由于搜索空间爆炸,为任意神经网络找到一个合适的模型平行时间表是一项非三重任务。在这项工作中,我们提出了一个模型平行框架 TAP, 自动搜索最佳数据和高频平行时间表。利用神经网络可以作为定向单行图( 其中可能只有有限的一组频繁子图)代表的关键洞察力, 我们设计了一个图表缩略算法, 以有效地折叠搜索空间。 TAP 运行在神经网络大小的亚线复杂度上。实验显示TAP 比最先进的自动平行框架快20美元, 时间- 160美元, 其发现的时间表的性能与专家设计的时间表相比具有竞争力。

相关内容

TAP

关注 811

ACM应用感知TAP(ACM Transactions on Applied Perception)旨在通过发表有助于统一这些领域研究的高质量论文来增强计算机科学与心理学/感知之间的协同作用。该期刊发表跨学科研究，在跨计算机科学和感知心理学的任何主题领域都具有重大而持久的价值。所有论文都必须包含感知和计算机科学两个部分。主题包括但不限于：视觉感知：计算机图形学，科学/数据/信息可视化，数字成像，计算机视觉，立体和3D显示技术。听觉感知：听觉显示和界面，听觉听觉编码，空间声音，语音合成和识别。触觉：触觉渲染，触觉输入和感知。感觉运动知觉：手势输入，身体运动输入。感官感知：感官整合，多模式渲染和交互。官网地址：http://dblp.uni-trier.de/db/journals/tap/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日