Bit- mixer: 具有运行时位宽选择的混合精度网络 (Bit-Mixer: Mixed-precision networks with runtime bit-width selection) - 专知论文

会员服务 ·

0

Networking · 层 · 查准率/准确率 · 模型评估 · 优化器 ·

2021 年 3 月 31 日

Bit-Mixer: Mixed-precision networks with runtime bit-width selection

翻译：Bit- mixer: 具有运行时位宽选择的混合精度网络

Adrian Bulat,Georgios Tzimiropoulos

Mixed-precision networks allow for a variable bit-width quantization for every layer in the network. A major limitation of existing work is that the bit-width for each layer must be predefined during training time. This allows little flexibility if the characteristics of the device on which the network is deployed change during runtime. In this work, we propose Bit-Mixer, the very first method to train a meta-quantized network where during test time any layer can change its bid-width without affecting at all the overall network's ability for highly accurate inference. To this end, we make 2 key contributions: (a) Transitional Batch-Norms, and (b) a 3-stage optimization process which is shown capable of training such a network. We show that our method can result in mixed precision networks that exhibit the desirable flexibility properties for on-device deployment without compromising accuracy. Code will be made available.

翻译：混合精度网络允许对网络的每个层进行可变的位宽量化。现有工作的一个主要限制是,在培训期间必须预先确定每个层的位宽。如果网络部署的设备的特性在运行期间发生变化,这几乎没有灵活性。在这项工作中,我们提出Bit- 混合器,这是培训元量化网络的第一种方法,在测试期间,任何层可以改变其标宽,而不会影响整个网络进行高度准确的推断的能力。为此,我们做出两项关键贡献:(a) 过渡批次-诺姆斯,以及(b) 显示能够培训这种网络的三阶段优化进程。我们表明,我们的方法可以产生混合精准网络,在不破坏准确性的情况下展示在设备上部署所需的灵活性。

0

相关内容

Networking

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【CVPR2021】坐标注意力的高效移动网络设计

专知会员服务

23+阅读 · 2021年3月9日

【斯坦福】距离编码-为结构表示学习设计更强大的GNN.

专知会员服务

45+阅读 · 2020年9月3日

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

专知会员服务

66+阅读 · 2020年6月22日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

17+阅读 · 2020年6月4日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【推荐论文】随机加权神经网络中隐藏着什么? （What’s Hidden in a Randomly Weighted Neural Network?），Vivek Ramanujan、Mitchell Wortsman、Aniruddha Kembhavi

【推荐论文】随机加权神经网络中隐藏着什么? （What’s Hidden in a Randomly Weighted Neural Network?），Vivek Ramanujan、Mitchell Wortsman、Aniruddha Kembhavi

专知会员服务

10+阅读 · 2019年12月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

已删除

将门创投

3+阅读 · 2019年1月29日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Arxiv

0+阅读 · 2021年5月26日

Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth

Arxiv

0+阅读 · 2021年5月26日

Runtime Monitoring for Markov Decision Processes

Arxiv

0+阅读 · 2021年5月26日

Operator Compression with Deep Neural Networks

Operator Compression with Deep Neural Networks

Arxiv

0+阅读 · 2021年5月25日

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

Arxiv

7+阅读 · 2020年12月15日

DynaBERT: Dynamic BERT with Adaptive Width and Depth

Arxiv

8+阅读 · 2020年10月9日

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommendation

Arxiv

11+阅读 · 2019年6月13日

Graph-Based Recommendation System

Graph-Based Recommendation System

Arxiv

4+阅读 · 2018年7月31日

Premise selection with neural networks and distributed representation of features

Arxiv

3+阅读 · 2018年7月26日

Quantizing deep convolutional networks for efficient inference: A whitepaper

Quantizing deep convolutional networks for efficient inference: A whitepaper

Arxiv

6+阅读 · 2018年6月21日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

【CVPR2021】坐标注意力的高效移动网络设计

专知会员服务

23+阅读 · 2021年3月9日

【斯坦福】距离编码-为结构表示学习设计更强大的GNN.

专知会员服务

45+阅读 · 2020年9月3日

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

【MIT】最优传输图神经网络，Optimal Transport Graph Neural Networks

专知会员服务

66+阅读 · 2020年6月22日

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

【ICML2020】序数非负矩阵分解推荐，On the Number of Linear Regions of Convolutional Neural Networks

专知会员服务

17+阅读 · 2020年6月4日

【清华大学】图随机神经网络，Graph Random Neural Networks

【清华大学】图随机神经网络，Graph Random Neural Networks

专知会员服务

156+阅读 · 2020年5月26日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

【ICLR-2020】网络反卷积，NETWORK DECONVOLUTION

专知会员服务

39+阅读 · 2020年2月21日

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

【WWW2020-MAGNN】异质图嵌入的集合图神经网络 MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding

专知会员服务

116+阅读 · 2020年2月10日

【推荐论文】随机加权神经网络中隐藏着什么? （What’s Hidden in a Randomly Weighted Neural Network?），Vivek Ramanujan、Mitchell Wortsman、Aniruddha Kembhavi

【推荐论文】随机加权神经网络中隐藏着什么? （What’s Hidden in a Randomly Weighted Neural Network?），Vivek Ramanujan、Mitchell Wortsman、Aniruddha Kembhavi

专知会员服务

10+阅读 · 2019年12月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《战场能源实战化最佳实践：大规模作战中的发电、储能与配电体系》美陆军最新报告

《大西洋决心行动及涉乌克兰美国政府活动报告》最新120页

战术边缘计算：加速军事情报周期革命

《现代环境不确定性下的多域作战：小国防御体系构建》

相关资讯

已删除

将门创投

3+阅读 · 2019年1月29日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

分布式TensorFlow入门指南

分布式TensorFlow入门指南

机器学习研究会

4+阅读 · 2017年11月28日

相关论文

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Arxiv

0+阅读 · 2021年5月26日

Optimization of Graph Neural Networks: Implicit Acceleration by Skip Connections and More Depth

Arxiv

0+阅读 · 2021年5月26日

Runtime Monitoring for Markov Decision Processes

Arxiv

0+阅读 · 2021年5月26日

Operator Compression with Deep Neural Networks

Operator Compression with Deep Neural Networks

Arxiv

0+阅读 · 2021年5月25日

Fitting the Search Space of Weight-sharing NAS with Graph Convolutional Networks

Arxiv

7+阅读 · 2020年12月15日

DynaBERT: Dynamic BERT with Adaptive Width and Depth

Arxiv

8+阅读 · 2020年10月9日

Knowledge-aware Graph Neural Networks with Label Smoothness Regularization for Recommendation

Arxiv

11+阅读 · 2019年6月13日

Graph-Based Recommendation System

Graph-Based Recommendation System

Arxiv

4+阅读 · 2018年7月31日

Premise selection with neural networks and distributed representation of features

Arxiv

3+阅读 · 2018年7月26日

Quantizing deep convolutional networks for efficient inference: A whitepaper

Quantizing deep convolutional networks for efficient inference: A whitepaper

Arxiv

6+阅读 · 2018年6月21日

微信扫码咨询专知VIP会员