POET: 具有集成再物质化和粘贴的微小装置培训神经网络 (POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging) - 专知论文

会员服务 ·

0

POET · 可约的 · Integration · 边 · Neural Networks ·

2022 年 7 月 15 日

POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging

翻译：POET: 具有集成再物质化和粘贴的微小装置培训神经网络

Shishir G. Patil,Paras Jain,Prabal Dutta,Ion Stoica,Joseph E. Gonzalez

from arxiv, Proceedings of the 39th International Conference on Machine Learning 2022 (ICML 2022)

Fine-tuning models on edge devices like mobile phones would enable privacy-preserving personalization over sensitive data. However, edge training has historically been limited to relatively small models with simple architectures because training is both memory and energy intensive. We present POET, an algorithm to enable training large neural networks on memory-scarce battery-operated edge devices. POET jointly optimizes the integrated search search spaces of rematerialization and paging, two algorithms to reduce the memory consumption of backpropagation. Given a memory budget and a run-time constraint, we formulate a mixed-integer linear program (MILP) for energy-optimal training. Our approach enables training significantly larger models on embedded devices while reducing energy consumption while not modifying mathematical correctness of backpropagation. We demonstrate that it is possible to fine-tune both ResNet-18 and BERT within the memory constraints of a Cortex-M class embedded device while outperforming current edge training methods in energy efficiency. POET is an open-source project available at https://github.com/ShishirPatil/poet

翻译：边端设备(如移动电话)的微调模型将使个人对敏感数据保持隐私,然而,边端培训历来限于相对较小的具有简单结构的小型模型,因为培训既有记忆力也有能量密集度。我们提出POET算法,以便能够对大型神经网络进行内存催泪电池操作边缘设备的培训。POET共同优化了再成像和传动的综合搜索搜索空间,用两种算法来减少反反向调整的内存消耗。鉴于记忆预算和运行时间的限制,我们为能量优化培训制定了混合因果线性程序(MILP)。我们的方法使得能够对嵌入装置进行大得多的模型培训,同时减少能源消耗,同时不改变反向调整的数学正确性。我们证明,在Cortex-M级嵌入装置的记忆限制下,可以对ResNet-18和BERT进行微调,同时在能源效率方面超过目前的边缘培训方法。POET是一个开放源项目,可在https://github.com/ShishhirPat/poet/poet上查阅。

0

相关内容

POET

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

面向智能电网负荷预测的电力大数据关键技术

国家自然科学基金

8+阅读 · 2014年12月31日

c-Myc诱导的长链非编码RNA AFAP1-AS1促进NSCLC细胞增殖机制的初步研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Realized GARCH框架的波动率和相关性模型理论和应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

寻找动脉夹层治疗的新靶点：机械张力通过SAC诱导血管MMP9表达的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Fibulin-5/β1-integrin 信号通路在醛固酮诱导血管平滑肌细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

汉滩病毒活化TLR4-TRAF6-SFK信号通路致血管内皮细胞通透性升高的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于网络理论的复杂产品制造过程非线性耦合质量控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

PPARg/CIITA复合物对I型胶原基因(COL1A2)的转录调控

国家自然科学基金

0+阅读 · 2008年12月31日

Strategyproof Scheduling with Predictions

Arxiv

0+阅读 · 2022年9月8日

FADE: Enabling Large-Scale Federated Adversarial Training on Resource-Constrained Edge Devices

Arxiv

0+阅读 · 2022年9月8日

Hardware Accelerator and Neural Network Co-Optimization for Ultra-Low-Power Audio Processing Devices

Arxiv

1+阅读 · 2022年9月8日

Training Scale-Invariant Neural Networks on the Sphere Can Happen in Three Regimes

Arxiv

0+阅读 · 2022年9月8日

Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO

Arxiv

0+阅读 · 2022年9月8日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Arxiv

16+阅读 · 2020年3月30日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Strategyproof Scheduling with Predictions

Arxiv

0+阅读 · 2022年9月8日

FADE: Enabling Large-Scale Federated Adversarial Training on Resource-Constrained Edge Devices

Arxiv

0+阅读 · 2022年9月8日

Hardware Accelerator and Neural Network Co-Optimization for Ultra-Low-Power Audio Processing Devices

Arxiv

1+阅读 · 2022年9月8日

Training Scale-Invariant Neural Networks on the Sphere Can Happen in Three Regimes

Arxiv

0+阅读 · 2022年9月8日

Exploring Adversarial Attacks and Defenses in Vision Transformers trained with DINO

Arxiv

0+阅读 · 2022年9月8日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Neural Architecture Search without Training

Neural Architecture Search without Training

Arxiv

10+阅读 · 2021年6月11日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Go Wide, Then Narrow: Efficient Training of Deep Thin Networks

Arxiv

15+阅读 · 2020年7月1日

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

L^2-GCN: Layer-Wise and Learned Efficient Training of Graph Convolutional Networks

Arxiv

16+阅读 · 2020年3月30日

相关基金

面向智能电网负荷预测的电力大数据关键技术

国家自然科学基金

8+阅读 · 2014年12月31日

c-Myc诱导的长链非编码RNA AFAP1-AS1促进NSCLC细胞增殖机制的初步研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Realized GARCH框架的波动率和相关性模型理论和应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

寻找动脉夹层治疗的新靶点：机械张力通过SAC诱导血管MMP9表达的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Fibulin-5/β1-integrin 信号通路在醛固酮诱导血管平滑肌细胞凋亡中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

汉滩病毒活化TLR4-TRAF6-SFK信号通路致血管内皮细胞通透性升高的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于网络理论的复杂产品制造过程非线性耦合质量控制方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

PPARg/CIITA复合物对I型胶原基因(COL1A2)的转录调控

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员