通过平衡勘探-勘探-开发权衡平衡进行动态零散培训 (Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off) - 专知论文

会员服务 ·

0

稀疏 · 模型评估 · Weight · MoDELS · 可约的 ·

2022 年 11 月 30 日

Dynamic Sparse Training via Balancing the Exploration-Exploitation Trade-off

翻译：通过平衡勘探-勘探-开发权衡平衡进行动态零散培训

Shaoyi Huang,Bowen Lei,Dongkuan Xu,Hongwu Peng,Yue Sun,Mimi Xie,Caiwen Ding

Over-parameterization of deep neural networks (DNNs) has shown high prediction accuracy for many applications. Although effective, the large number of parameters hinders its popularity on resource-limited devices and has an outsize environmental impact. Sparse training (using a fixed number of nonzero weights in each iteration) could significantly mitigate the training costs by reducing the model size. However, existing sparse training methods mainly use either random-based or greedy-based drop-and-grow strategies, resulting in local minimal and low accuracy. In this work, to assist explainable sparse training, we propose important weights Exploitation and coverage Exploration to characterize Dynamic Sparse Training (DST-EE), and provide quantitative analysis of these two metrics. We further design an acquisition function and provide the theoretical guarantees for the proposed method and clarify its convergence property. Experimental results show that sparse models (up to 98\% sparsity) obtained by our proposed method outperform the SOTA sparse training methods on a wide variety of deep learning tasks. On VGG-19 / CIFAR-100, ResNet-50 / CIFAR-10, ResNet-50 / CIFAR-100, our method has even higher accuracy than dense models. On ResNet-50 / ImageNet, the proposed method has up to 8.2\% accuracy improvement compared to SOTA sparse training methods.

翻译：深度神经网络(DNNs)的超度测量显示,许多应用的预测准确度很高,虽然效果有效,但大量参数的众多参数妨碍了其在资源有限的装置上的受欢迎度,并对环境产生了超大的影响。粗糙的培训(在每迭中使用固定数量的非零加权数)可以通过缩小模型规模,大大降低培训成本;然而,现有稀少的培训方法主要使用随机或贪婪的下降和成长战略,导致当地最低和低精确度。在这项工作中,为了协助可解释的稀少培训,我们建议进行重要的权重开发和覆盖面探索,以说明动态松散培训的特点(DST-EE),并对这两个指标进行定量分析。我们进一步设计一种获取功能,为拟议方法提供理论保障,并澄清其趋同性。实验结果表明,我们拟议方法的稀少模型(高达98 ⁇ )超越了SOTA的深度培训方法,从而导致各种深层次学习任务。在VGG-19/CIFAR-100、ResNet-50/CIFAR-10、ResNet-50/CIFAR-100Srent AS AS-RAS-IOM AS-R-IGRA-IAR-IAR-IAR-IAR-IAR-IAR-IAR-IGRAS-IGLA 方法比起来的精确度改进了我们的拟议方法。

0

相关内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

巨噬细胞上的Tim-3在阿司匹林诱导的动脉粥样硬化稳定斑块中的作用

国家自然科学基金

0+阅读 · 2015年12月31日

MaPIP1;1介导香蕉响应干旱胁迫的分子调控机制解析

国家自然科学基金

0+阅读 · 2015年12月31日

四种多酚类物质改善AD记忆的协同作用及对Aβ转运调节及NF-κB炎性反应抑制机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

共刺激分子Tim-1和Tim-3对肥大细胞介导的抗弓形虫感染免疫调节机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于石墨烯及氧化石墨烯衬底的拉曼增强机理理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

Perp在类风湿性关节炎外周Th17细胞存活中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

ANCA诱导的ROS在调控中性粒细胞凋亡∕NETosis转换中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

碳量子点与酞菁特定复合物的可控合成及其光吸收与转化性质的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Robust online active learning

Arxiv

0+阅读 · 2023年2月1日

Stream-based active learning with linear models

Arxiv

0+阅读 · 2023年2月1日

QCRS: Improve Randomized Smoothing using Quasi-Concave Optimization

Arxiv

0+阅读 · 2023年2月1日

Evaluating Temporal Observation-Based Causal Discovery Techniques Applied to Road Driver Behaviour

Arxiv

0+阅读 · 2023年1月31日

Iterative Loop Learning Combining Self-Training and Active Learning for Domain Adaptive Semantic Segmentation

Arxiv

0+阅读 · 2023年1月31日

Alternating Updates for Efficient Transformers

Arxiv

0+阅读 · 2023年1月30日

Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing

Arxiv

0+阅读 · 2023年1月29日

Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

Arxiv

0+阅读 · 2023年1月29日

Streaming LifeLong Learning With Any-Time Inference

Arxiv

0+阅读 · 2023年1月27日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

59+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Robust online active learning

Arxiv

0+阅读 · 2023年2月1日

Stream-based active learning with linear models

Arxiv

0+阅读 · 2023年2月1日

QCRS: Improve Randomized Smoothing using Quasi-Concave Optimization

Arxiv

0+阅读 · 2023年2月1日

Evaluating Temporal Observation-Based Causal Discovery Techniques Applied to Road Driver Behaviour

Arxiv

0+阅读 · 2023年1月31日

Iterative Loop Learning Combining Self-Training and Active Learning for Domain Adaptive Semantic Segmentation

Arxiv

0+阅读 · 2023年1月31日

Alternating Updates for Efficient Transformers

Arxiv

0+阅读 · 2023年1月30日

Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing

Arxiv

0+阅读 · 2023年1月29日

Doubly Robust Interval Estimation for Optimal Policy Evaluation in Online Learning

Arxiv

0+阅读 · 2023年1月29日

Streaming LifeLong Learning With Any-Time Inference

Arxiv

0+阅读 · 2023年1月27日

Faster Meta Update Strategy for Noise-Robust Deep Learning

Arxiv

11+阅读 · 2021年4月30日

相关基金

巨噬细胞上的Tim-3在阿司匹林诱导的动脉粥样硬化稳定斑块中的作用

国家自然科学基金

0+阅读 · 2015年12月31日

MaPIP1;1介导香蕉响应干旱胁迫的分子调控机制解析

国家自然科学基金

0+阅读 · 2015年12月31日

四种多酚类物质改善AD记忆的协同作用及对Aβ转运调节及NF-κB炎性反应抑制机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

共刺激分子Tim-1和Tim-3对肥大细胞介导的抗弓形虫感染免疫调节机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于石墨烯及氧化石墨烯衬底的拉曼增强机理理论研究

国家自然科学基金

0+阅读 · 2013年12月31日

Perp在类风湿性关节炎外周Th17细胞存活中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

柽柳Dof转录因子的耐盐调控机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

ANCA诱导的ROS在调控中性粒细胞凋亡∕NETosis转换中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

碳量子点与酞菁特定复合物的可控合成及其光吸收与转化性质的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员