AGO:通过消除对图表优化的限制,提高移动AI的推论性能 (AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization) - 专知论文

会员服务 ·

0

Performer · Boosting（一种模型训练加速方式） · 优化器 · 图 · 推断 ·

2022 年 12 月 2 日

AGO: Boosting Mobile AI Inference Performance by Removing Constraints on Graph Optimization

翻译：AGO:通过消除对图表优化的限制,提高移动AI的推论性能

Zhiying Xu,Hongding Peng,Wei Wang

Traditional deep learning compilers rely on heuristics for subgraph generation, which impose extra constraints on graph optimization, e.g., each subgraph can only contain at most one complex operator. In this paper, we propose AGO, a framework for graph optimization with arbitrary structures to boost the inference performance of deep models by removing such constraints. To create new optimization opportunities for complicated subgraphs, we propose intensive operator fusion, which can effectively stitch multiple complex operators together for better performance. Further, we design a graph partitioning scheme that allows an arbitrary structure for each subgraph while guaranteeing the acyclic property among all generated subgraphs. Additionally, to enable efficient performance tuning on complicated subgraphs, we devise a novel divide-and-conquer tuning mechanism to orchestrate different system components. Through extensive experiments on various neural networks and mobile devices, we show that our system can improve the inference performance by up to 3.3x when compared with state-of-the-art deep compilers.

翻译：传统的深层学习编译者依靠图层生成的逻辑学,这给图层优化造成额外的限制,例如,每个图层只能包含一个最复杂的操作者。在本文中,我们提议AGO,这是一个图层优化框架,其结构具有任意性,通过消除这些限制来提高深层模型的推论性能。为了为复杂的子层生成创造新的优化机会,我们提议密集的操作者聚合,它可以有效地将多个复杂操作者缝合在一起,以提高性能。此外,我们设计了一个图层分割计划,允许每个图层的任意结构,同时保证所有生成的子层的周期性能。此外,为了能够对复杂的子层进行高效的性能调整,我们设计了一个新型的分化和化调机制,通过对各种神经网络和移动装置进行广泛的实验,我们表明我们的系统可以改进推论性能,与最先进的深层编译者相比,可以达到3.3x。

0

相关内容

Performer

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Survivin在低氧诱导喉癌淋巴管生成中的调控作用及其分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

PTEN与TGFBR1互为ceRNA调控口腔鳞癌自噬的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

GMBP1短肽通过GRP78逆转胃癌耐药的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HIF-1α信号途径研究硫化氢对缺氧诱导Aβ生成和聚积的抑制作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

新型超声微泡介导靶向Survivin基因siRNA治疗原发性肝细胞癌

国家自然科学基金

0+阅读 · 2011年12月31日

化疗药物诱导大肠癌上皮间质转化过程中PrPc-STAT3通路的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

肾上腺源性及原发性高血压线粒体tRNAIle、tRNALeu(UUR)和tRNAlys基因突变的差异对比研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Connectivity Enhanced Safe Neural Network Planner for Lane Changing in Mixed Traffic

Arxiv

0+阅读 · 2023年2月6日

NeuRI: Diversifying DNN Generation via Inductive Rule Inference

Arxiv

0+阅读 · 2023年2月4日

Hierarchically Composing Level Generators for the Creation of Complex Structures

Arxiv

0+阅读 · 2023年2月3日

An adversarial feature learning strategy for debiasing neural networks

Arxiv

0+阅读 · 2023年2月2日

Stochastic Optimization for Counterfactual Explanations

Arxiv

0+阅读 · 2023年2月2日

Graph Ordering Attention Networks

Arxiv

12+阅读 · 2022年11月21日

Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Arxiv

10+阅读 · 2021年12月13日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

Overcoming Catastrophic Forgetting in Graph Neural Networks

Arxiv

14+阅读 · 2020年12月10日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

VIP会员

文章信息

相关主题

Boosting（一种模型训练加速方式）

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Connectivity Enhanced Safe Neural Network Planner for Lane Changing in Mixed Traffic

Arxiv

0+阅读 · 2023年2月6日

NeuRI: Diversifying DNN Generation via Inductive Rule Inference

Arxiv

0+阅读 · 2023年2月4日

Hierarchically Composing Level Generators for the Creation of Complex Structures

Arxiv

0+阅读 · 2023年2月3日

An adversarial feature learning strategy for debiasing neural networks

Arxiv

0+阅读 · 2023年2月2日

Stochastic Optimization for Counterfactual Explanations

Arxiv

0+阅读 · 2023年2月2日

Graph Ordering Attention Networks

Arxiv

12+阅读 · 2022年11月21日

Sparse Structure Learning via Graph Neural Networks for Inductive Document Classification

Arxiv

10+阅读 · 2021年12月13日

Interpreting and Unifying Graph Neural Networks with An Optimization Framework

Arxiv

18+阅读 · 2021年1月28日

Overcoming Catastrophic Forgetting in Graph Neural Networks

Arxiv

14+阅读 · 2020年12月10日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

相关基金

Survivin在低氧诱导喉癌淋巴管生成中的调控作用及其分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

PTEN与TGFBR1互为ceRNA调控口腔鳞癌自噬的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

GMBP1短肽通过GRP78逆转胃癌耐药的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

半导体衬底上FeSe薄膜的外延生长及界面超导

国家自然科学基金

0+阅读 · 2013年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于HIF-1α信号途径研究硫化氢对缺氧诱导Aβ生成和聚积的抑制作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

新型超声微泡介导靶向Survivin基因siRNA治疗原发性肝细胞癌

国家自然科学基金

0+阅读 · 2011年12月31日

化疗药物诱导大肠癌上皮间质转化过程中PrPc-STAT3通路的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

肾上腺源性及原发性高血压线粒体tRNAIle、tRNALeu(UUR)和tRNAlys基因突变的差异对比研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员