PowerPruning：权重和激活的选择用于功率高效神经网络加速 (PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration) - 专知论文

会员服务 ·

0

MAC · DNN · 神经网络 · 单元 · 神经网络加速器 ·

2023 年 3 月 24 日

PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration

翻译：PowerPruning：权重和激活的选择用于功率高效神经网络加速

Richard Petri,Grace Li Zhang,Yiran Chen,Ulf Schlichtmann,Bing Li

Deep neural networks (DNNs) have been successfully applied in various fields. A major challenge of deploying DNNs, especially on edge devices, is power consumption, due to the large number of multiply-and-accumulate (MAC) operations. To address this challenge, we propose PowerPruning, a novel method to reduce power consumption in digital neural network accelerators by selecting weights that lead to less power consumption in MAC operations. In addition, the timing characteristics of the selected weights together with all activation transitions are evaluated. The weights and activations that lead to small delays are further selected. Consequently, the maximum delay of the sensitized circuit paths in the MAC units is reduced even without modifying MAC units, which thus allows a flexible scaling of supply voltage to reduce power consumption further. Together with retraining, the proposed method can reduce power consumption of DNNs on hardware by up to 78.3% with only a slight accuracy loss.

翻译：深度神经网络（DNN）在各个领域都得到了成功应用。部署DNN的主要挑战，特别是在边缘设备上，是功耗过大，因为需要大量的乘加（MAC）操作。为了解决这个挑战，我们提出了PowerPruning，一种在数字神经网络加速器中通过选择导致MAC操作功耗较小的权重来减少功耗的新方法。此外，评估选定权重的时间特性以及所有激活转换。选择导致较小延迟的权重和激活。因此，即使不修改MAC单元，也可以减少MAC单元中敏感电路路径的最大延迟，从而允许灵活缩放供应电压以进一步降低功耗。与重新训练一起，所提出的方法可以使硬件上的DNN功耗降低高达78.3%，仅产生轻微的精度损失。

0

相关内容

MAC

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

【2022新书】深度学习R语言实战，第二版，568页pdf

【2022新书】深度学习R语言实战，第二版，568页pdf

专知会员服务

86+阅读 · 2022年10月23日

【深度神经网络加速器的硬件近似技术综述】Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

【深度神经网络加速器的硬件近似技术综述】Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

专知会员服务

16+阅读 · 2022年3月17日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

ECCV2022 中小型矩阵的批量高效(batch-efficient)特征分解

ECCV2022 中小型矩阵的批量高效(batch-efficient)特征分解

极市平台

0+阅读 · 2022年7月16日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

自定义损失函数Gradient Boosting

自定义损失函数Gradient Boosting

AI研习社

14+阅读 · 2018年10月16日

随机路径选择模型的交通悖论特征研究

国家自然科学基金

0+阅读 · 2015年12月31日

ACK1介导的受体酪氨酸激酶信号在肿瘤发生发展中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

亚低温治疗改善肾上腺素通过β受体-G蛋白信号通路致复苏后心功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

用于促进植物高效光合作用的微晶玻璃的设计和双频光协同转换增强机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

谐振泵浦2.1μm单频脉冲激光振荡与放大技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖皮质激素抵抗介导的小胶质细胞过度激活在脑外伤后CIRCI发病中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

针对暗硅片的多核处理器体系结构研究和优化平台搭建

国家自然科学基金

0+阅读 · 2012年12月31日

Pharicin B稳定维甲酸受体的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

内收缩MRCI的新方案及其程序实现

国家自然科学基金

0+阅读 · 2011年12月31日

用于均相免疫分析的新型发光上转换纳米晶/量子点分子光开关研究

国家自然科学基金

0+阅读 · 2009年12月31日

One-shot neural band selection for spectral recovery

Arxiv

0+阅读 · 2023年5月16日

ReLU soothes the NTK condition number and accelerates optimization for wide neural networks

Arxiv

0+阅读 · 2023年5月15日

A Unified Analysis of AdaGrad with Weighted Aggregation and Momentum Acceleration

Arxiv

0+阅读 · 2023年5月15日

BEM-based fast frequency sweep for acoustic scattering by periodic slab

Arxiv

0+阅读 · 2023年5月15日

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Arxiv

0+阅读 · 2023年5月14日

Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts

Arxiv

0+阅读 · 2023年5月12日

Parameterized Verification of Disjunctive Timed Networks

Arxiv

0+阅读 · 2023年5月12日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth

Arxiv

20+阅读 · 2021年5月10日

VIP会员

文章信息

相关主题

神经网络加速器

相关VIP内容

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

【干货书】数据分析优化，Optimization for Modern Data Analysis，117页pdf

专知会员服务

63+阅读 · 2023年2月15日

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

【2022新书】深度学习R语言实战，第二版，568页pdf

【2022新书】深度学习R语言实战，第二版，568页pdf

专知会员服务

86+阅读 · 2022年10月23日

【深度神经网络加速器的硬件近似技术综述】Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

【深度神经网络加速器的硬件近似技术综述】Hardware Approximate Techniques for Deep Neural Network Accelerators: A Survey

专知会员服务

16+阅读 · 2022年3月17日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

可解释高效异构图卷积网络，Interpretable and Efficient Heterogeneous Graph Convolutional Network

专知会员服务

63+阅读 · 2020年7月12日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

ECCV2022 中小型矩阵的批量高效(batch-efficient)特征分解

ECCV2022 中小型矩阵的批量高效(batch-efficient)特征分解

极市平台

0+阅读 · 2022年7月16日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

自定义损失函数Gradient Boosting

自定义损失函数Gradient Boosting

AI研习社

14+阅读 · 2018年10月16日

相关论文

One-shot neural band selection for spectral recovery

Arxiv

0+阅读 · 2023年5月16日

ReLU soothes the NTK condition number and accelerates optimization for wide neural networks

Arxiv

0+阅读 · 2023年5月15日

A Unified Analysis of AdaGrad with Weighted Aggregation and Momentum Acceleration

Arxiv

0+阅读 · 2023年5月15日

BEM-based fast frequency sweep for acoustic scattering by periodic slab

Arxiv

0+阅读 · 2023年5月15日

Federated TD Learning over Finite-Rate Erasure Channels: Linear Speedup under Markovian Sampling

Arxiv

0+阅读 · 2023年5月14日

Towards Convergence Rates for Parameter Estimation in Gaussian-gated Mixture of Experts

Arxiv

0+阅读 · 2023年5月12日

Parameterized Verification of Disjunctive Timed Networks

Arxiv

0+阅读 · 2023年5月12日

Interpretable and Efficient Heterogeneous Graph Convolutional Network

Arxiv

15+阅读 · 2021年9月8日

A Survey of Quantization Methods for Efficient Neural Network Inference

Arxiv

22+阅读 · 2021年6月21日

Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth

Arxiv

20+阅读 · 2021年5月10日

相关基金

随机路径选择模型的交通悖论特征研究

国家自然科学基金

0+阅读 · 2015年12月31日

ACK1介导的受体酪氨酸激酶信号在肿瘤发生发展中的作用

国家自然科学基金

0+阅读 · 2014年12月31日

亚低温治疗改善肾上腺素通过β受体-G蛋白信号通路致复苏后心功能障碍的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

用于促进植物高效光合作用的微晶玻璃的设计和双频光协同转换增强机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

谐振泵浦2.1μm单频脉冲激光振荡与放大技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

糖皮质激素抵抗介导的小胶质细胞过度激活在脑外伤后CIRCI发病中的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

针对暗硅片的多核处理器体系结构研究和优化平台搭建

国家自然科学基金

0+阅读 · 2012年12月31日

Pharicin B稳定维甲酸受体的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

内收缩MRCI的新方案及其程序实现

国家自然科学基金

0+阅读 · 2011年12月31日

用于均相免疫分析的新型发光上转换纳米晶/量子点分子光开关研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员