对拆分 DNN 计算器采用低复杂度方法优化率-扭曲优化优化率-可变比位拉压缩 (A Low-Complexity Approach to Rate-Distortion Optimized Variable Bit-Rate Compression for Split DNN Computing) - 专知论文

会员服务 ·

0

DNN · Storage · 优化器 · 全 · MoDELS ·

2022 年 8 月 24 日

A Low-Complexity Approach to Rate-Distortion Optimized Variable Bit-Rate Compression for Split DNN Computing

翻译：对拆分 DNN 计算器采用低复杂度方法优化率-扭曲优化优化率-可变比位拉压缩

Parual Datta,Nilesh Ahuja,V. Srinivasa Somayazulu,Omesh Tickoo

from arxiv, ICPR 2022

Split computing has emerged as a recent paradigm for implementation of DNN-based AI workloads, wherein a DNN model is split into two parts, one of which is executed on a mobile/client device and the other on an edge-server (or cloud). Data compression is applied to the intermediate tensor from the DNN that needs to be transmitted, addressing the challenge of optimizing the rate-accuracy-complexity trade-off. Existing split-computing approaches adopt ML-based data compression, but require that the parameters of either the entire DNN model, or a significant portion of it, be retrained for different compression levels. This incurs a high computational and storage burden: training a full DNN model from scratch is computationally demanding, maintaining multiple copies of the DNN parameters increases storage requirements, and switching the full set of weights during inference increases memory bandwidth. In this paper, we present an approach that addresses all these challenges. It involves the systematic design and training of bottleneck units - simple, low-cost neural networks - that can be inserted at the point of split. Our approach is remarkably lightweight, both during training and inference, highly effective and achieves excellent rate-distortion performance at a small fraction of the compute and storage overhead compared to existing methods.

翻译：分解计算是实施基于 DNN 的 AI 工作量的近期范例, 其中DNN 模型分为两个部分, 其中一个在移动/ 客户设备上执行, 另一个在边缘服务器( 或云) 上执行。数据压缩适用于 DNN 需要传输的中点, 应对优化速率- 准确度- 兼容性交易的挑战。现有的分解计算方法采用了基于 ML 的数据压缩, 但要求对整个 DNN 模型或其中相当一部分的参数进行不同压缩水平的再培训。这造成了一个很高的计算和存储负担: 从零开始培训全 DNNN 模型在计算上要求很高, 保持多份 DNN 参数的存储要求, 并在推论增加记忆带宽时转换整套重量。在本文中, 我们介绍了一种应对所有这些挑战的方法。它涉及系统设计和培训瓶式单元( 简单、低成本的神经网络) 的参数可以插入到分裂点。这造成了一个很高的计算和存储负担: 从零到零的完全, 我们的方法具有极低的性, 和高压的存储率, 和高压。

0

相关内容

DNN

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

活性气体分子硫化氢对RNASET2介导的黑素细胞损伤保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

土壤湿度对中国当代和未来夏季气候变率的影响

国家自然科学基金

1+阅读 · 2013年12月31日

功能化石墨烯量子点合成与荧光传感

国家自然科学基金

0+阅读 · 2012年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

混合随机系统的动力学行为

国家自然科学基金

1+阅读 · 2011年12月31日

DNA损伤诱导的p53非依赖性细胞凋亡途径- - -Bim途径

国家自然科学基金

0+阅读 · 2009年12月31日

分离压作用下含胶束结构的可溶性活性剂溶液铺展过程研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于多小波构造的纹理特征表示与提取

国家自然科学基金

0+阅读 · 2009年12月31日

氧化还原信号在SIRT1影响tat介导HIV转录激活的作用机制

国家自然科学基金

0+阅读 · 2008年12月31日

仿射流形上的非线性分析

国家自然科学基金

0+阅读 · 2008年12月31日

Residual-based error correction for neural operator accelerated infinite-dimensional Bayesian inverse problems

Arxiv

0+阅读 · 2022年10月6日

A Distributed System-level Diagnosis Model for the Implementation of Unreliable Failure Detectors

Arxiv

0+阅读 · 2022年10月6日

Cooperative Coverage with a Leader and a Wingmate in Communication-Constrained Environments

Arxiv

0+阅读 · 2022年10月6日

Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

Arxiv

0+阅读 · 2022年10月5日

Exact Recursive Probabilistic Programming

Arxiv

0+阅读 · 2022年10月3日

Fixed-point iterations for several dissimilarity measure barycenters in the Gaussian case

Arxiv

0+阅读 · 2022年10月3日

A Novel Explainable Out-of-Distribution Detection Approach for Spiking Neural Networks

Arxiv

0+阅读 · 2022年9月30日

Energy Efficient Hardware Acceleration of Neural Networks with Power-of-Two Quantisation

Arxiv

0+阅读 · 2022年9月30日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

相关论文

Residual-based error correction for neural operator accelerated infinite-dimensional Bayesian inverse problems

Arxiv

0+阅读 · 2022年10月6日

A Distributed System-level Diagnosis Model for the Implementation of Unreliable Failure Detectors

Arxiv

0+阅读 · 2022年10月6日

Cooperative Coverage with a Leader and a Wingmate in Communication-Constrained Environments

Arxiv

0+阅读 · 2022年10月6日

Probabilistically Robust Recourse: Navigating the Trade-offs between Costs and Robustness in Algorithmic Recourse

Arxiv

0+阅读 · 2022年10月5日

Exact Recursive Probabilistic Programming

Arxiv

0+阅读 · 2022年10月3日

Fixed-point iterations for several dissimilarity measure barycenters in the Gaussian case

Arxiv

0+阅读 · 2022年10月3日

A Novel Explainable Out-of-Distribution Detection Approach for Spiking Neural Networks

Arxiv

0+阅读 · 2022年9月30日

Energy Efficient Hardware Acceleration of Neural Networks with Power-of-Two Quantisation

Arxiv

0+阅读 · 2022年9月30日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Arxiv

17+阅读 · 2019年9月9日

相关基金

活性气体分子硫化氢对RNASET2介导的黑素细胞损伤保护作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

土壤湿度对中国当代和未来夏季气候变率的影响

国家自然科学基金

1+阅读 · 2013年12月31日

功能化石墨烯量子点合成与荧光传感

国家自然科学基金

0+阅读 · 2012年12月31日

模块化非线性系统辨识

国家自然科学基金

0+阅读 · 2011年12月31日

混合随机系统的动力学行为

国家自然科学基金

1+阅读 · 2011年12月31日

DNA损伤诱导的p53非依赖性细胞凋亡途径- - -Bim途径

国家自然科学基金

0+阅读 · 2009年12月31日

分离压作用下含胶束结构的可溶性活性剂溶液铺展过程研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于多小波构造的纹理特征表示与提取

国家自然科学基金

0+阅读 · 2009年12月31日

氧化还原信号在SIRT1影响tat介导HIV转录激活的作用机制

国家自然科学基金

0+阅读 · 2008年12月31日

仿射流形上的非线性分析

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员