差别化的 " 硬件效率联合配置 " 和 " 量化 " 硬件效率 (Differentiable Joint Pruning and Quantization for Hardware Efficiency) - 专知论文

会员服务 ·

0

剪枝 · 模型评估 · 优化器 · 可约的 · 损失函数（机器学习） ·

2021 年 4 月 4 日

Differentiable Joint Pruning and Quantization for Hardware Efficiency

翻译：差别化的 " 硬件效率联合配置 " 和 " 量化 " 硬件效率

Ying Wang,Yadong Lu,Tijmen Blankevoort

from arxiv, Accepted to ECCV 2020

We present a differentiable joint pruning and quantization (DJPQ) scheme. We frame neural network compression as a joint gradient-based optimization problem, trading off between model pruning and quantization automatically for hardware efficiency. DJPQ incorporates variational information bottleneck based structured pruning and mixed-bit precision quantization into a single differentiable loss function. In contrast to previous works which consider pruning and quantization separately, our method enables users to find the optimal trade-off between both in a single training procedure. To utilize the method for more efficient hardware inference, we extend DJPQ to integrate structured pruning with power-of-two bit-restricted quantization. We show that DJPQ significantly reduces the number of Bit-Operations (BOPs) for several networks while maintaining the top-1 accuracy of original floating-point models (e.g., 53x BOPs reduction in ResNet18 on ImageNet, 43x in MobileNetV2). Compared to the conventional two-stage approach, which optimizes pruning and quantization independently, our scheme outperforms in terms of both accuracy and BOPs. Even when considering bit-restricted quantization, DJPQ achieves larger compression ratios and better accuracy than the two-stage approach.

翻译：我们提出了一个不同的联合运行和量化( DJPQ) 方案。我们将神经网络压缩作为基于梯度的优化联合问题框架, 将模型的运行和量化自动交换, 以便实现硬件效率。 DJPQ 将基于结构结构的运行和混合比位精确度的变异信息瓶颈数纳入一个不同的损失函数。与以前考虑单独运行和量化的工程相比, 我们的方法使用户能够在单一的培训程序中找到两种方法的最佳取舍。为了使用效率更高的硬件推断方法, 我们扩展 DJPQ, 将结构化的运行与2位限制量的量化自动交换。我们显示, DJPQ 在保持原始浮点模型的顶级和顶级精度( 例如, 在图像网络的ResNet18中减少53x BOPs, 在移动网络中减少43x)。与常规的两阶段方法相比, 在独立地实现双级打印和四级化的精确度的精确度时, 我们显示, DPQQ会大幅减少一些网络的位操作数量, 同时保持原始浮点模型的顶级( 例如, ResNet18 方法减少43x NetV2x)。

1

相关内容

【TPAMI2021】鲁棒可微SVD，Robust Differentiable SVD

专知会员服务

23+阅读 · 2021年4月10日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

专知会员服务

27+阅读 · 2020年3月19日

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

专知会员服务

105+阅读 · 2019年11月22日

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

专知会员服务

16+阅读 · 2019年11月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

BERT 瘦身之路：Distillation，Quantization，Pruning

BERT 瘦身之路：Distillation，Quantization，Pruning

AINLP

10+阅读 · 2019年10月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

专知

7+阅读 · 2018年2月26日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Greedy Layer Pruning: Decreasing Inference Time of Transformer Models

Arxiv

0+阅读 · 2021年5月31日

Communication efficient privacy-preserving distributed optimization using adaptive differential quantization

Arxiv

0+阅读 · 2021年5月30日

Pruning-Aware Merging for Efficient Multitask Inference

Arxiv

0+阅读 · 2021年5月28日

Differentiable Artificial Reverberation

Arxiv

0+阅读 · 2021年5月28日

Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index

Arxiv

0+阅读 · 2021年5月28日

Pruning and Slicing Neural Networks using Formal Verification

Arxiv

0+阅读 · 2021年5月28日

Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation

Arxiv

1+阅读 · 2021年5月27日

Robust Differentiable SVD

Arxiv

9+阅读 · 2021年4月8日

Meta-Learning with Differentiable Convex Optimization

Arxiv

5+阅读 · 2019年4月23日

HAQ: Hardware-Aware Automated Quantization

HAQ: Hardware-Aware Automated Quantization

Arxiv

6+阅读 · 2018年11月21日

VIP会员

文章信息

相关主题

损失函数（机器学习）

相关VIP内容

【TPAMI2021】鲁棒可微SVD，Robust Differentiable SVD

专知会员服务

23+阅读 · 2021年4月10日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

【经典书】应用随机微分方程，324页pdf，Applied Stochastic Differential Equations

专知会员服务

58+阅读 · 2020年11月21日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

【CVPR2020-Oral-浙江大学】深度知识迁移的深度归因图，DEPARA: Deep Attribution Graph

专知会员服务

27+阅读 · 2020年3月19日

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

【北京智源大会2019】贝叶斯深度学习（ Bayesian Deep Learning ），清华大学| 朱军

专知会员服务

105+阅读 · 2019年11月22日

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

【ICCV 2019】贝叶斯优化的1-Bit CNNs 《Bayesian Optimized 1-Bit CNNs》

专知会员服务

16+阅读 · 2019年11月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战场能源实战化最佳实践：大规模作战中的发电、储能与配电体系》美陆军最新报告

《大西洋决心行动及涉乌克兰美国政府活动报告》最新120页

战术边缘计算：加速军事情报周期革命

《现代环境不确定性下的多域作战：小国防御体系构建》

相关资讯

BERT 瘦身之路：Distillation，Quantization，Pruning

BERT 瘦身之路：Distillation，Quantization，Pruning

AINLP

10+阅读 · 2019年10月22日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

【论文推荐】最新五篇图像分割相关论文—R2U-Net、ScatterNet混合深度学习、分离卷积编解码、控制、Embedding

专知

7+阅读 · 2018年2月26日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Greedy Layer Pruning: Decreasing Inference Time of Transformer Models

Arxiv

0+阅读 · 2021年5月31日

Communication efficient privacy-preserving distributed optimization using adaptive differential quantization

Arxiv

0+阅读 · 2021年5月30日

Pruning-Aware Merging for Efficient Multitask Inference

Arxiv

0+阅读 · 2021年5月28日

Differentiable Artificial Reverberation

Arxiv

0+阅读 · 2021年5月28日

Joint Learning of Deep Retrieval Model and Product Quantization based Embedding Index

Arxiv

0+阅读 · 2021年5月28日

Pruning and Slicing Neural Networks using Formal Verification

Arxiv

0+阅读 · 2021年5月28日

Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation

Arxiv

1+阅读 · 2021年5月27日

Robust Differentiable SVD

Arxiv

9+阅读 · 2021年4月8日

Meta-Learning with Differentiable Convex Optimization

Arxiv

5+阅读 · 2019年4月23日

HAQ: Hardware-Aware Automated Quantization

HAQ: Hardware-Aware Automated Quantization

Arxiv

6+阅读 · 2018年11月21日

微信扫码咨询专知VIP会员