动态 DNNs 在移动和嵌入平台上满足运行时资源管理 (Dynamic DNNs Meet Runtime Resource Management on Mobile and Embedded Platforms) - 专知论文

会员服务 ·

0

Performer · 层 · 资源管理 · 相似度 · 模型评估 ·

2022 年 6 月 7 日

Dynamic DNNs Meet Runtime Resource Management on Mobile and Embedded Platforms

翻译：动态 DNNs 在移动和嵌入平台上满足运行时资源管理

Lei Xun,Bashir M. Al-Hashimi,Jonathon Hare,Geoff V. Merrett

from arxiv, Accepted as a presentation at Fourth UK Mobile, Wearable and Ubiquitous Systems Research Symposium (MobiUK 2022)

Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to low latency and better privacy. However, efficient deployment on these platforms is challenging due to the intensive computation and memory access. We propose a holistic system design for DNN performance and energy optimisation, combining the trade-off opportunities in both algorithms and hardware. The system can be viewed as three abstract layers: the device layer contains heterogeneous computing resources; the application layer has multiple concurrent workloads; and the runtime resource management layer monitors the dynamically changing algorithms' performance targets as well as hardware resources and constraints, and tries to meet them by tuning the algorithm and hardware at the same time. Moreover, We illustrate the runtime approach through a dynamic version of 'once-for-all network' (namely Dynamic-OFA), which can scale the ConvNet architecture to fit heterogeneous computing resources efficiently and has good generalisation for different model architectures such as Transformer. Compared to the state-of-the-art Dynamic DNNs, our experimental results using ImageNet on a Jetson Xavier NX show that the Dynamic-OFA is up to 3.5x (CPU), 2.4x (GPU) faster for similar ImageNet Top-1 accuracy, or 3.8% (CPU), 5.1% (GPU) higher accuracy at similar latency. Furthermore, compared with Linux governor (e.g. performance, schedutil), our runtime approach reduces the energy consumption by 16.5% at similar latency.

翻译：深心神经网络(DNN) 的推论正在越来越多地在移动平台和嵌入平台上执行,原因是低延迟和更好的隐私。然而,由于大量计算和内存访问,在这些平台上的有效部署具有挑战性。我们提议了DNN性能和能源优化的综合系统设计,将算法和硬件的权衡机会结合起来。该系统可被视为三个抽象层:设备层包含多种计算资源;应用层有多重并行工作量;运行时间资源管理层监测动态变化算法的性能目标以及硬件资源和限制,并试图通过同时调整算法和硬件来达到它们。此外,我们通过动态版本的“全方位网络”性能和能源优化来说明运行时间方法。这个系统可以扩大ConvNet结构,以适应各种混合计算资源,并且能够很好地概括诸如变压器等不同的模型结构。与目前状态的动态动态 DNNNNNNNPU方法以及硬件和限制相比,我们使用图像网络的实验结果,同时调整算法和硬件的算法和硬件。此外,SDV-G-G-POVA的精确度(S 2.5) 和S-POVA的类似图像的精确度为35。

0

相关内容

Performer

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

炎症作用下circ_0007986/miRNA调控食管癌细胞耐药促进肿瘤转移机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

基于能量耗散镁合金非均质接头微区疲劳裂纹扩展机制及多尺度分析与表征

国家自然科学基金

0+阅读 · 2015年12月31日

磁性碳纳米管的原位制备机理及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型多孔碳-二硫化硒纳米复合材料的可控制备与储锂机制

国家自然科学基金

0+阅读 · 2015年12月31日

Neuregulin-1/ErbB信号传导系统在缺血性心脏病心肌血管重构中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

石墨烯负载纳米Co基催化剂的可控制备与催化性能

国家自然科学基金

0+阅读 · 2014年12月31日

面心立方金属静力韧性的内外尺度效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

多酸基石墨烯多孔复合纳米材料的可控制备与性能

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯材料的储锂机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

DeepIPC: Deeply Integrated Perception and Control for Mobile Robot in Real Environments

Arxiv

0+阅读 · 2022年7月20日

Towards a High-performance and Secure Memory System and Architecture for Emerging Applications

Arxiv

0+阅读 · 2022年7月20日

FedNet2Net: Saving Communication and Computations in Federated Learning with Model Growing

Arxiv

0+阅读 · 2022年7月19日

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Arxiv

35+阅读 · 2022年4月25日

Transformers in Time Series: A Survey

Arxiv

34+阅读 · 2022年2月15日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

37+阅读 · 2019年9月18日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

DeepIPC: Deeply Integrated Perception and Control for Mobile Robot in Real Environments

Arxiv

0+阅读 · 2022年7月20日

Towards a High-performance and Secure Memory System and Architecture for Emerging Applications

Arxiv

0+阅读 · 2022年7月20日

FedNet2Net: Saving Communication and Computations in Federated Learning with Model Growing

Arxiv

0+阅读 · 2022年7月19日

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Arxiv

35+阅读 · 2022年4月25日

Transformers in Time Series: A Survey

Arxiv

34+阅读 · 2022年2月15日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

A Survey of Machine Learning for Computer Architecture and Systems

Arxiv

18+阅读 · 2021年2月16日

Distributed Machine Learning on Mobile Devices: A Survey

Distributed Machine Learning on Mobile Devices: A Survey

Arxiv

37+阅读 · 2019年9月18日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

Deep learning for time series classification: a review

Arxiv

12+阅读 · 2019年3月14日

相关基金

炎症作用下circ_0007986/miRNA调控食管癌细胞耐药促进肿瘤转移机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

基于能量耗散镁合金非均质接头微区疲劳裂纹扩展机制及多尺度分析与表征

国家自然科学基金

0+阅读 · 2015年12月31日

磁性碳纳米管的原位制备机理及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型多孔碳-二硫化硒纳米复合材料的可控制备与储锂机制

国家自然科学基金

0+阅读 · 2015年12月31日

Neuregulin-1/ErbB信号传导系统在缺血性心脏病心肌血管重构中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

石墨烯负载纳米Co基催化剂的可控制备与催化性能

国家自然科学基金

0+阅读 · 2014年12月31日

面心立方金属静力韧性的内外尺度效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

多酸基石墨烯多孔复合纳米材料的可控制备与性能

国家自然科学基金

0+阅读 · 2011年12月31日

石墨烯材料的储锂机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员