DNN 受控边缘节点的导断 (Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes) - 专知论文

会员服务 ·

0

模型评估 · 可约的 · 边 · Weight · Networking ·

2023 年 1 月 25 日

Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes

翻译：DNN 受控边缘节点的导断

Matteo Risso,Alessio Burrello,Luca Benini,Enrico Macii,Massimo Poncino,Daniele Jahier Pagliari

Quantization is widely employed in both cloud and edge systems to reduce the memory occupation, latency, and energy consumption of deep neural networks. In particular, mixed-precision quantization, i.e., the use of different bit-widths for different portions of the network, has been shown to provide excellent efficiency gains with limited accuracy drops, especially with optimized bit-width assignments determined by automated Neural Architecture Search (NAS) tools. State-of-the-art mixed-precision works layer-wise, i.e., it uses different bit-widths for the weights and activations tensors of each network layer. In this work, we widen the search space, proposing a novel NAS that selects the bit-width of each weight tensor channel independently. This gives the tool the additional flexibility of assigning a higher precision only to the weights associated with the most informative features. Testing on the MLPerf Tiny benchmark suite, we obtain a rich collection of Pareto-optimal models in the accuracy vs model size and accuracy vs energy spaces. When deployed on the MPIC RISC-V edge processor, our networks reduce the memory and energy for inference by up to 63% and 27% respectively compared to a layer-wise approach, for the same accuracy.

翻译：云层和边缘系统广泛采用量化方法,以减少深神经网络的内存性、延缓力和能量消耗,特别是混合精密度量度,即对网络不同部分使用不同比特维度,显示可带来极好的增效,精确度下降有限,特别是以自动神经结构搜索工具(NAS)确定的最佳比特维度任务,从而优化比特维度任务。从层层来看,最先进的混合精度工作状态使用不同比特维度的重量,激活每个网络层的电压。在这项工作中,我们扩大了搜索空间,提出了一个新的NAS,独立选择每个重量维特的比特度。这为工具提供了更大的灵活性,仅对与信息最丰富的特征相关的重量给予更高的精确度。测试了MLPerf 小型精度基准套件,我们获得了大量精度模型的精度模型集,激活了每个网络层的电压。我们扩大了搜索空间,提出了新的NAS,选择了每个重量维特维特维特维度的比重度,同时运用了27PIC网络的精度和精确度,从而将Merview-ricreto-rial-rial-ration-ration-ration-rational-ration 分别用于27Vlation-vical-cal-view-view-vical-vil-viewcal as-vical as-view-viewal as-vil-vil as-vil as-vil-vical-vical-vical-vical-vical-vical-vical-vical-vical-vical-vicl-vical-vical 。

0

相关内容

模型评估

机器学习系统设计系统评估标准

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图挖掘与多关系学习，亚马逊与CMU-WWW2021教程，附161页ppt

专知会员服务

37+阅读 · 2021年4月20日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

TRAIL诱骗受体DcR2介导糖尿病肾病衰老肾小管上皮细胞凋亡逃逸的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

多孔材料负载离子液体的功能化设计及对油脂酯交换催化作用

国家自然科学基金

0+阅读 · 2014年12月31日

新基因DDA1调控细胞周期蛋白Cyclin D1在肺癌发生与发展中的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

锆钛酸铅与铌酸钾钠铁电材料中子和伽马辐照效应及损伤机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

经颅磁刺激对 Alzheimer病小鼠脑内homer1a-BK channel信号通路的影响及疗效评估

国家自然科学基金

0+阅读 · 2012年12月31日

水稻OsNOX2抗旱的分子机制及其调控信号研究

国家自然科学基金

0+阅读 · 2012年12月31日

细晶Ni-Mn-Ga-Gd合金薄膜马氏体相变的尺寸效应与高温形状记忆特性

国家自然科学基金

0+阅读 · 2012年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Treeformer: Dense Gradient Trees for Efficient Attention Computation

Arxiv

0+阅读 · 2023年3月17日

A Low-Cost Neural ODE with Depthwise Separable Convolution for Edge Domain Adaptation on FPGAs

Arxiv

0+阅读 · 2023年3月17日

Dynamic Structure Pruning for Compressing CNNs

Arxiv

0+阅读 · 2023年3月17日

Constrained Monotonic Neural Networks

Arxiv

0+阅读 · 2023年3月15日

Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2023年3月15日

Workload Behavior Driven Memory Subsystem Design for Hyperscale

Arxiv

0+阅读 · 2023年3月15日

Camera Frame Misalignment in a Teleoperated Eye-in-Hand Robot: Effects and a Simple Correction Method

Arxiv

0+阅读 · 2023年3月15日

Flex-Net: A Graph Neural Network Approach to Resource Management in Flexible Duplex Networks

Arxiv

0+阅读 · 2023年3月15日

Enabling Deep Learning on Edge Devices

Arxiv

19+阅读 · 2022年10月6日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图挖掘与多关系学习，亚马逊与CMU-WWW2021教程，附161页ppt

专知会员服务

37+阅读 · 2021年4月20日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《“海马斯”高机动火箭炮系统：重塑为多任务平台》38页

《卫星星座任务规划新方法》

美国空军如何在电磁对抗环境中备战

【CMU博士论文】《生成式机器人：用于人机协同创作的自监督学习》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

19篇ICML2019论文摘录选读！

19篇ICML2019论文摘录选读！

专知

28+阅读 · 2019年4月28日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Treeformer: Dense Gradient Trees for Efficient Attention Computation

Arxiv

0+阅读 · 2023年3月17日

A Low-Cost Neural ODE with Depthwise Separable Convolution for Edge Domain Adaptation on FPGAs

Arxiv

0+阅读 · 2023年3月17日

Dynamic Structure Pruning for Compressing CNNs

Arxiv

0+阅读 · 2023年3月17日

Constrained Monotonic Neural Networks

Arxiv

0+阅读 · 2023年3月15日

Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth Estimation

Arxiv

0+阅读 · 2023年3月15日

Workload Behavior Driven Memory Subsystem Design for Hyperscale

Arxiv

0+阅读 · 2023年3月15日

Camera Frame Misalignment in a Teleoperated Eye-in-Hand Robot: Effects and a Simple Correction Method

Arxiv

0+阅读 · 2023年3月15日

Flex-Net: A Graph Neural Network Approach to Resource Management in Flexible Duplex Networks

Arxiv

0+阅读 · 2023年3月15日

Enabling Deep Learning on Edge Devices

Arxiv

19+阅读 · 2022年10月6日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

相关基金

TRAIL诱骗受体DcR2介导糖尿病肾病衰老肾小管上皮细胞凋亡逃逸的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

多孔材料负载离子液体的功能化设计及对油脂酯交换催化作用

国家自然科学基金

0+阅读 · 2014年12月31日

新基因DDA1调控细胞周期蛋白Cyclin D1在肺癌发生与发展中的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

锆钛酸铅与铌酸钾钠铁电材料中子和伽马辐照效应及损伤机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

经颅磁刺激对 Alzheimer病小鼠脑内homer1a-BK channel信号通路的影响及疗效评估

国家自然科学基金

0+阅读 · 2012年12月31日

水稻OsNOX2抗旱的分子机制及其调控信号研究

国家自然科学基金

0+阅读 · 2012年12月31日

细晶Ni-Mn-Ga-Gd合金薄膜马氏体相变的尺寸效应与高温形状记忆特性

国家自然科学基金

0+阅读 · 2012年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员