多点量化:混合精度,无混合精度 (Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision) - 专知论文

会员服务 ·

0

查准率/准确率 · Weight · 权值向量 · 向量化 · 近似 ·

2021 年 1 月 14 日

Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision

翻译：多点量化:混合精度,无混合精度

Xingchao Liu,Mao Ye,Dengyong Zhou,Qiang Liu

from arxiv, Accepted by AAAI2021

We consider the post-training quantization problem, which discretizes the weights of pre-trained deep neural networks without re-training the model. We propose multipoint quantization, a quantization method that approximates a full-precision weight vector using a linear combination of multiple vectors of low-bit numbers; this is in contrast to typical quantization methods that approximate each weight using a single low precision number. Computationally, we construct the multipoint quantization with an efficient greedy selection procedure, and adaptively decides the number of low precision points on each quantized weight vector based on the error of its output. This allows us to achieve higher precision levels for important weights that greatly influence the outputs, yielding an 'effect of mixed precision' but without physical mixed precision implementations (which requires specialized hardware accelerators). Empirically, our method can be implemented by common operands, bringing almost no memory and computation overhead. We show that our method outperforms a range of state-of-the-art methods on ImageNet classification and it can be generalized to more challenging tasks like PASCAL VOC object detection.

翻译：我们考虑了培训后量化问题,它分解了培训前深神经网络的重量,而没有再对模型进行再培训。我们建议多点量化,这是一种使用低位数多个矢量的线性组合,接近全精度重量矢量的量化方法;这与使用单一低精度数字接近每个重量的典型量化方法形成对照。计算上,我们用高效的贪婪选择程序构建多点量化,根据输出错误,适应性地决定每个四分制重量矢量的低精度点数。这使我们能够对影响产出的重要重量达到更高的精确度,产生“混合精度效应”,但没有物理精度执行(这需要专门的硬件加速器 ) 。有规律地说,我们的方法可以通过普通的软体执行,几乎没有记忆和计算间接费用。我们表明,我们的方法在图像网络分类上超越了一系列最先进的标准方法,并且可以普遍化为更具挑战性的任务,如PASAL VOC物体探测。

0

相关内容

查准率/准确率

查准率/准确率

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【CMU】最新深度学习课程， Introduction to Deep Learning

【CMU】最新深度学习课程， Introduction to Deep Learning

专知会员服务

38+阅读 · 2020年9月12日

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

专知会员服务

87+阅读 · 2020年5月11日

【华为-诺亚实验室】动态BERT, Dynamic BERT with Adaptive Width and Depth

【华为-诺亚实验室】动态BERT, Dynamic BERT with Adaptive Width and Depth

专知会员服务

24+阅读 · 2020年4月13日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

深度神经网络模型压缩与加速综述

深度神经网络模型压缩与加速综述

专知会员服务

129+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ICLR 2020会议的16篇最佳深度学习论文

ICLR 2020会议的16篇最佳深度学习论文

AINLP

5+阅读 · 2020年5月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年1月11日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】用于边缘检测更丰富的卷积特征 (CVPR-32）

【泡泡一分钟】用于边缘检测更丰富的卷积特征 (CVPR-32）

泡泡机器人SLAM

7+阅读 · 2018年3月4日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

Quantization-Guided Training for Compact TinyML Models

Arxiv

0+阅读 · 2021年3月10日

MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks

MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks

Arxiv

0+阅读 · 2021年3月10日

BCNN: A Binary CNN with All Matrix Ops Quantized to 1 Bit Precision

Arxiv

0+阅读 · 2021年3月5日

Multilevel quasi-Monte Carlo for random elliptic eigenvalue problems II: Efficient algorithms and numerical results

Arxiv

0+阅读 · 2021年3月5日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Arxiv

5+阅读 · 2018年12月19日

Quantization Mimic: Towards Very Tiny CNN for Object Detection

Quantization Mimic: Towards Very Tiny CNN for Object Detection

Arxiv

5+阅读 · 2018年9月13日

Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation

Arxiv

5+阅读 · 2018年3月13日

Multiple Object Detection, Tracking and Long-Term Dynamics Learning in Large 3D Maps

Arxiv

6+阅读 · 2018年1月28日

Semantic Segmentation via Highly Fused Convolutional Network with Multiple Soft Cost Functions

Arxiv

3+阅读 · 2018年1月4日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【CMU】最新深度学习课程， Introduction to Deep Learning

【CMU】最新深度学习课程， Introduction to Deep Learning

专知会员服务

38+阅读 · 2020年9月12日

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

【陈天奇】TVM：端到端自动深度学习编译器，244页ppt

专知会员服务

87+阅读 · 2020年5月11日

【华为-诺亚实验室】动态BERT, Dynamic BERT with Adaptive Width and Depth

【华为-诺亚实验室】动态BERT, Dynamic BERT with Adaptive Width and Depth

专知会员服务

24+阅读 · 2020年4月13日

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

【ICLR2020】深度神经网络优化轨迹的平衡点，The Break-Even Point on Optimization Trajectories of Deep Neural Networks

专知会员服务

34+阅读 · 2020年2月27日

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

【北京智源大会2019】神经网络的优化Optimization for Overparametrized Deep Neural Networks，北京大学 | 王立威

专知会员服务

23+阅读 · 2019年11月21日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

深度神经网络模型压缩与加速综述

深度神经网络模型压缩与加速综述

专知会员服务

129+阅读 · 2019年10月12日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

ICLR 2020会议的16篇最佳深度学习论文

ICLR 2020会议的16篇最佳深度学习论文

AINLP

5+阅读 · 2020年5月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

6+阅读 · 2019年1月11日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【泡泡一分钟】用于边缘检测更丰富的卷积特征 (CVPR-32）

【泡泡一分钟】用于边缘检测更丰富的卷积特征 (CVPR-32）

泡泡机器人SLAM

7+阅读 · 2018年3月4日

最佳实践：深度学习用于自然语言处理（三）

最佳实践：深度学习用于自然语言处理（三）

待字闺中

3+阅读 · 2017年8月20日

相关论文

Quantization-Guided Training for Compact TinyML Models

Arxiv

0+阅读 · 2021年3月10日

MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks

MixMo: Mixing Multiple Inputs for Multiple Outputs via Deep Subnetworks

Arxiv

0+阅读 · 2021年3月10日

BCNN: A Binary CNN with All Matrix Ops Quantized to 1 Bit Precision

Arxiv

0+阅读 · 2021年3月5日

Multilevel quasi-Monte Carlo for random elliptic eigenvalue problems II: Efficient algorithms and numerical results

Arxiv

0+阅读 · 2021年3月5日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

Arxiv

5+阅读 · 2018年12月19日

Quantization Mimic: Towards Very Tiny CNN for Object Detection

Quantization Mimic: Towards Very Tiny CNN for Object Detection

Arxiv

5+阅读 · 2018年9月13日

Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation

Arxiv

5+阅读 · 2018年3月13日

Multiple Object Detection, Tracking and Long-Term Dynamics Learning in Large 3D Maps

Arxiv

6+阅读 · 2018年1月28日

Semantic Segmentation via Highly Fused Convolutional Network with Multiple Soft Cost Functions

Arxiv

3+阅读 · 2018年1月4日

微信扫码咨询专知VIP会员