缩略图IR: 深层学习中粗略汇编的复合摘要 (SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning) - 专知论文

会员服务 ·

0

Learning · 稀疏 · Tensor · 编译器 · Performer ·

2022 年 8 月 26 日

SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning

翻译：缩略图IR: 深层学习中粗略汇编的复合摘要

Zihao Ye,Ruihang Lai,Junru Shao,Tianqi Chen,Luis Ceze

from arxiv, Fixed some typos

Sparse tensors are rapidly becoming critical components of modern deep learning workloads. However, developing high-performance sparse operators can be difficult and tedious, and existing vendor libraries cannot satisfy the escalating demands from new operators. Sparse tensor compilers simplify the development of operators, but efficient sparse compilation for deep learning remains challenging because a single sparse format cannot maximize hardware efficiency, and single-shot compilers cannot keep up with latest hardware and system advances. We show that the key to addressing both challenges is two forms of composability. In this paper, we propose SparseTIR, a sparse tensor compilation abstraction that offers composable formats and composable transformations for deep learning workloads. SparseTIR constructs a search space over these composable components for performance tuning. With these improvements, SparseTIR obtains consistent performance speedups vs vendor libraries on GPUs for single operators: 1.1-3.3x for GNN operators and 1.1-4.4x for sparse transformer operators. SparseTIR also accelerates end-to-end GNNs by 1.1-2.2x for GraphSAGE training and 0.9-26x for RGCN inference.

翻译：然而,开发高性能的稀有操作员可能困难且乏味,现有销售商图书馆无法满足新操作员不断升级的需求。粗度的散装编集器简化了操作员的开发,但高效的零散编集仍具有挑战性,因为单一的稀散格式无法最大限度地提高硬件效率,单发编集器无法跟上最新的硬件和系统进步。我们显示,应对这两个挑战的关键是两种可复式形式。我们在此文件中提议,SprassTIR, 一种稀有的散散式散装散式编集抽象,为深层学习工作量提供可制格式和可复式转换。 SprassTIR为这些可变装组件建造了一个搜索空间,用于进行性能调整。有了这些改进,SprassTIR在单个操作员的GPUS上获得了一致的性能加速度,相对于供应商图书馆:GNN操作员的1.1-3.3x和稀薄变压器操作员的1.1-4.4x。 SprassTIR还加快了GNNNNS的端对端端端至端点速度,为 1.1-2.2x用于图形SAGSAGSAGSAGSAGA培训,0.26x。

0

相关内容

Learning

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

CuS/ZnS/s-g-C3N4异质结分级结构的构筑、生长机理及光催化性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

SIRT1介导组蛋白乙酰化在同型半胱氨酸致动脉粥样硬化中的作用及特异性miRNAs调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

ROS在调控心肌衰老过程中Beclin 1-Vps34复合体功能和自噬流的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细粒棘球蚴感染小鼠Mo-MDSC源免疫抑制相关分子的研究

国家自然科学基金

0+阅读 · 2012年12月31日

深海放线菌Streptomyces sp. SCSIO 03032抗肿瘤天然产物Spiroindimicins生物合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型苯并咪唑酮酰胺杀菌剂的合成及构效关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型选择性CDKs抑制剂的设计、合成与生物活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

HSG诱导血管平滑肌细胞凋亡和抑制增殖的功能序列及肽研究

国家自然科学基金

0+阅读 · 2008年12月31日

构建ASPP2/P53缺失的GP120转基因鼠研究HAD神经凋亡机制

国家自然科学基金

0+阅读 · 2008年12月31日

Structured Multi-task Learning for Molecular Property Prediction

Arxiv

0+阅读 · 2022年10月6日

Recycling Scraps: Improving Private Learning by Leveraging Intermediate Checkpoints

Arxiv

0+阅读 · 2022年10月4日

LEAPER: Fast and Accurate FPGA-based System Performance Prediction via Transfer Learning

Arxiv

0+阅读 · 2022年10月2日

Gradient Gating for Deep Multi-Rate Learning on Graphs

Arxiv

0+阅读 · 2022年10月2日

Sparse tree-based initialization for neural networks

Arxiv

0+阅读 · 2022年9月30日

Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition

Arxiv

0+阅读 · 2022年9月30日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Recent advances in deep learning theory

Recent advances in deep learning theory

Arxiv

50+阅读 · 2020年12月20日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

相关论文

Structured Multi-task Learning for Molecular Property Prediction

Arxiv

0+阅读 · 2022年10月6日

Recycling Scraps: Improving Private Learning by Leveraging Intermediate Checkpoints

Arxiv

0+阅读 · 2022年10月4日

LEAPER: Fast and Accurate FPGA-based System Performance Prediction via Transfer Learning

Arxiv

0+阅读 · 2022年10月2日

Gradient Gating for Deep Multi-Rate Learning on Graphs

Arxiv

0+阅读 · 2022年10月2日

Sparse tree-based initialization for neural networks

Arxiv

0+阅读 · 2022年9月30日

Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition

Arxiv

0+阅读 · 2022年9月30日

The Principles of Deep Learning Theory

Arxiv

65+阅读 · 2021年6月18日

Recent advances in deep learning theory

Recent advances in deep learning theory

Arxiv

50+阅读 · 2020年12月20日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

Learning in the Frequency Domain

Learning in the Frequency Domain

Arxiv

11+阅读 · 2020年3月12日

相关基金

CuS/ZnS/s-g-C3N4异质结分级结构的构筑、生长机理及光催化性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

SIRT1介导组蛋白乙酰化在同型半胱氨酸致动脉粥样硬化中的作用及特异性miRNAs调控机制

国家自然科学基金

0+阅读 · 2014年12月31日

ROS在调控心肌衰老过程中Beclin 1-Vps34复合体功能和自噬流的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细粒棘球蚴感染小鼠Mo-MDSC源免疫抑制相关分子的研究

国家自然科学基金

0+阅读 · 2012年12月31日

深海放线菌Streptomyces sp. SCSIO 03032抗肿瘤天然产物Spiroindimicins生物合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型苯并咪唑酮酰胺杀菌剂的合成及构效关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型选择性CDKs抑制剂的设计、合成与生物活性研究

国家自然科学基金

0+阅读 · 2009年12月31日

HSG诱导血管平滑肌细胞凋亡和抑制增殖的功能序列及肽研究

国家自然科学基金

0+阅读 · 2008年12月31日

构建ASPP2/P53缺失的GP120转基因鼠研究HAD神经凋亡机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员