深度学习推理的边缘设备排程优化：基于强化学习的管线Coral边缘TPUs排程 (RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs) - 专知论文

会员服务 ·

0

调度 · 计算图 · 边缘 · 最优 · 深度学习推理 ·

2023 年 4 月 10 日

RESPECT: Reinforcement Learning based Edge Scheduling on Pipelined Coral Edge TPUs

翻译：深度学习推理的边缘设备排程优化：基于强化学习的管线Coral边缘TPUs排程

Jiaqi Yin,Yingjie Li,Daniel Robinson,Cunxi Yu

from arxiv, 6 pages, ACM/IEEE Design Automation Conference (DAC'23)

Deep neural networks (DNNs) have substantial computational and memory requirements, and the compilation of its computational graphs has a great impact on the performance of resource-constrained (e.g., computation, I/O, and memory-bound) edge computing systems. While efficient execution of their computational graph requires an effective scheduling algorithm, generating the optimal scheduling solution is a challenging NP-hard problem. Furthermore, the complexity of scheduling DNN computational graphs will further increase on pipelined multi-core systems considering memory communication cost, as well as the increasing size of DNNs. Using the synthetic graph for the training dataset, this work presents a reinforcement learning (RL) based scheduling framework RESPECT, which learns the behaviors of optimal optimization algorithms and generates near-optimal scheduling results with short solving runtime overhead. Our framework has demonstrated up to $\sim2.5\times$ real-world on-chip inference runtime speedups over the commercial compiler with ten popular ImageNet models deployed on the physical Coral Edge TPUs system. Moreover, compared to the exact optimization methods, the proposed RL scheduling improves the scheduling optimization runtime by up to 683$\times$ speedups compared to the commercial compiler and matches the exact optimal solutions with up to 930$\times$ speedups. Finally, we perform a comprehensive generalizability test, which demonstrates RESPECT successfully imitates optimal solving behaviors from small synthetic graphs to large real-world DNNs computational graphs.

翻译：深度神经网络（DNNs）具有巨大的计算和内存需求，其计算图的编译对资源受限的边缘计算系统的性能产生重大影响（例如计算、I/O和内存）。虽然有效执行其计算图需要一种有效的调度算法，但生成最佳调度解决方案是具有挑战性的NP-hard问题。此外，考虑内存通信成本以及DNN的不断增长的大小，在流水线多核系统上调度DNN计算图的复杂性将进一步增加。本文提出一种基于RL（强化学习）的调度框架RESPECT，使用训练数据集的合成图，学习最优化算法的行为，并生成近乎最优的调度结果，具有较短的求解运行时开销。我们的框架展示出与商业编译器相比高达$~2.5\times$ 的实际芯片推理运行时加速效果，使用10种常见的ImageNet模型部署在Coral Edge TPUs实际系统上。此外，与精确优化方法相比，所提出的RL调度将调度优化运行时提高了高达683倍的速度，比商业编译器匹配的精确最优解提高了高达930倍的速度。最后，我们进行了全面的通用性测试，展示RESPECT成功地将从小的合成图到大的实际DNN计算图的最优求解行

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【Nature. Mach. Intell. 】基于条件transformer、知识蒸馏和强化学习的多约束分子生成

【Nature. Mach. Intell. 】基于条件transformer、知识蒸馏和强化学习的多约束分子生成

专知会员服务

30+阅读 · 2022年3月27日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

图与推荐

0+阅读 · 2022年10月7日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

图神经网络综述：方法及应用 | Deep Reading

图神经网络综述：方法及应用 | Deep Reading

AI100

36+阅读 · 2019年3月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

基于优化Schwarz算法的非线性预条件问题

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于模型的安全关键的信息物理融合系统的设计方法中的软件综合

国家自然科学基金

1+阅读 · 2014年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

基于混杂Petri网的微电网需求侧能量管理在线优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

针对视频分析的高能效异构硬件计算系统研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于ELAD和RNN的电动车用电动机运行效率快速优化关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下的服务动态组合及其优化技术的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于高性能集群计算的围棋机器博弈关键算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

大规模分布式系统实时可预测在线分析研究

国家自然科学基金

1+阅读 · 2008年12月31日

Real-Time Scheduling for Time-Sensitive Networking: A Systematic Review and Experimental Study

Arxiv

0+阅读 · 2023年5月26日

C-MCTS: Safe Planning with Monte Carlo Tree Search

Arxiv

0+阅读 · 2023年5月25日

Gorilla: Large Language Model Connected with Massive APIs

Arxiv

1+阅读 · 2023年5月24日

Optimal Fairness Scheduling for Coded Caching in Multi-AP Wireless Local Area Networks

Arxiv

0+阅读 · 2023年5月24日

Bulk-Switching Memristor-based Compute-In-Memory Module for Deep Neural Network Training

Arxiv

0+阅读 · 2023年5月23日

Full Stack Optimization of Transformer Inference: a Survey

Arxiv

19+阅读 · 2023年2月27日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

A Survey on Transformers in Reinforcement Learning

Arxiv

31+阅读 · 2023年1月8日

Time-Series Event Prediction with Evolutionary State Graph

Arxiv

14+阅读 · 2020年11月25日

A Survey on Edge Computing Systems and Tools

Arxiv

35+阅读 · 2019年11月7日

VIP会员

文章信息

相关主题

深度学习推理

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【Nature. Mach. Intell. 】基于条件transformer、知识蒸馏和强化学习的多约束分子生成

【Nature. Mach. Intell. 】基于条件transformer、知识蒸馏和强化学习的多约束分子生成

专知会员服务

30+阅读 · 2022年3月27日

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

【MIla】一种意识启发规划的基于模型强化学习，A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning

专知会员服务

23+阅读 · 2022年3月19日

机器学习组合优化

机器学习组合优化

专知会员服务

110+阅读 · 2021年2月16日

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

【牛津大学】深度残差强化学习，Deep Residual Reinforcement Learning

专知会员服务

84+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

不可错过！斯坦福《图学习》研讨会，Jure Leskovec主持，附slides！

图与推荐

0+阅读 · 2022年10月7日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

图神经网络综述：方法及应用 | Deep Reading

图神经网络综述：方法及应用 | Deep Reading

AI100

36+阅读 · 2019年3月17日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Real-Time Scheduling for Time-Sensitive Networking: A Systematic Review and Experimental Study

Arxiv

0+阅读 · 2023年5月26日

C-MCTS: Safe Planning with Monte Carlo Tree Search

Arxiv

0+阅读 · 2023年5月25日

Gorilla: Large Language Model Connected with Massive APIs

Arxiv

1+阅读 · 2023年5月24日

Optimal Fairness Scheduling for Coded Caching in Multi-AP Wireless Local Area Networks

Arxiv

0+阅读 · 2023年5月24日

Bulk-Switching Memristor-based Compute-In-Memory Module for Deep Neural Network Training

Arxiv

0+阅读 · 2023年5月23日

Full Stack Optimization of Transformer Inference: a Survey

Arxiv

19+阅读 · 2023年2月27日

A Survey on Causal Reinforcement Learning

Arxiv

29+阅读 · 2023年2月10日

A Survey on Transformers in Reinforcement Learning

Arxiv

31+阅读 · 2023年1月8日

Time-Series Event Prediction with Evolutionary State Graph

Arxiv

14+阅读 · 2020年11月25日

A Survey on Edge Computing Systems and Tools

Arxiv

35+阅读 · 2019年11月7日

相关基金

基于优化Schwarz算法的非线性预条件问题

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于模型的安全关键的信息物理融合系统的设计方法中的软件综合

国家自然科学基金

1+阅读 · 2014年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

基于混杂Petri网的微电网需求侧能量管理在线优化方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

针对视频分析的高能效异构硬件计算系统研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于ELAD和RNN的电动车用电动机运行效率快速优化关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

云计算环境下的服务动态组合及其优化技术的研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于高性能集群计算的围棋机器博弈关键算法研究

国家自然科学基金

0+阅读 · 2009年12月31日

大规模分布式系统实时可预测在线分析研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员