分析网络 " 环绕 " 渐进流动框架 (A Gradient Flow Framework For Analyzing Network Pruning) - 专知论文

会员服务 ·

0

剪枝 · MoDELS · Networking · Performer · Better ·

2021 年 2 月 1 日

A Gradient Flow Framework For Analyzing Network Pruning

翻译：分析网络 " 环绕 " 渐进流动框架

Ekdeep Singh Lubana,Robert P. Dick

from arxiv, Accepted at ICLR, 2021

Recent network pruning methods focus on pruning models early-on in training. To estimate the impact of removing a parameter, these methods use importance measures that were originally designed to prune trained models. Despite lacking justification for their use early-on in training, such measures result in surprisingly low accuracy loss. To better explain this behavior, we develop a general framework that uses gradient flow to unify state-of-the-art importance measures through the norm of model parameters. We use this framework to determine the relationship between pruning measures and evolution of model parameters, establishing several results related to pruning models early-on in training: (i) magnitude-based pruning removes parameters that contribute least to reduction in loss, resulting in models that converge faster than magnitude-agnostic methods; (ii) loss-preservation based pruning preserves first-order model evolution dynamics and is therefore appropriate for pruning minimally trained models; and (iii) gradient-norm based pruning affects second-order model evolution dynamics, such that increasing gradient norm via pruning can produce poorly performing models. We validate our claims on several VGG-13, MobileNet-V1, and ResNet-56 models trained on CIFAR-10/CIFAR-100. Code available at https://github.com/EkdeepSLubana/flowandprune.

翻译：最近的网络修剪方法侧重于培训中的早期修剪模型。为了估计去除参数的影响,这些方法使用了最初设计用于模拟培训模型的重要措施。尽管缺乏在培训中早期使用的理由,但这类措施导致的精度损失极低。为了更好地解释这一行为,我们开发了一个总框架,使用梯度流来通过模型参数规范统一最先进的重要措施。我们使用这个框架来确定裁剪措施与模型参数演变之间的关系,建立与培训早期修剪模型有关的若干结果:(一) 基于规模的修剪方法删除了对减少损失作用最小的参数,导致模型的趋同速度快于规模分析方法;(二) 基于修剪裁的损防护,保存了一阶模型演变动态,因此适合通过最起码的训练模型运行;(三) 基于梯度的修剪裁影响到第二阶梯度模型演变动态,因此,通过修剪裁方法增加梯度规范可以产生不良的模型。我们在几个VGG-13、MLFAR-Net-V1、IMFAR-FAR-FAR-C-C-CSLADSLAD/RAD/RSAR/RCNCAD/RADRAD/R510、AS-RADRAD/R-RSOL/RSAR/RDRDRSAR/ISDRDR/ISAR/ISDRDRDR/IS/IS/IS/IS/RCRAS/IS/IS/IS/IS/RC-0/RCRAS-0/IS/ISCRCRC-0/IS/IS/ISCRDAR/IS/IS/IS/IS/IS/IS/IS/IS-0/AS-0/IS/IS/AS-0/AS-0/AS-0/AS-0/AS-0/AS-0/IS-0/IS-0/AS-0/IS/IS/IS/IS/AS-0/IS/IS/ISC-0/IS/ISC-0/ISMAR-0/IS/ISAR/SSAR/RAS-01和Res)。

1

相关内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【MIT韩松博士-ICLR2020】端上自动机器学习-一劳永逸网络的NAS: Once-for-All Network

【MIT韩松博士-ICLR2020】端上自动机器学习-一劳永逸网络的NAS: Once-for-All Network

专知会员服务

58+阅读 · 2020年5月4日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【MIT-MLSys2020】神经网络剪枝的研究进展状态，Neural Network Pruning

【MIT-MLSys2020】神经网络剪枝的研究进展状态，Neural Network Pruning

专知会员服务

29+阅读 · 2020年3月10日

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

专知会员服务

17+阅读 · 2019年12月24日

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

专知会员服务

17+阅读 · 2019年11月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

5+阅读 · 2019年4月4日

Policy Targeting under Network Interference

Policy Targeting under Network Interference

Arxiv

0+阅读 · 2021年3月25日

Layer-Wise Data-Free CNN Compression

Arxiv

0+阅读 · 2021年3月25日

Sample-efficient Plasma Spray Process Configuration with Constrained Bayesian Optimization

Arxiv

0+阅读 · 2021年3月25日

Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation

Arxiv

0+阅读 · 2021年3月25日

Enabling Retrain-free Deep Neural Network Pruning using Surrogate Lagrangian Relaxation

Arxiv

0+阅读 · 2021年3月25日

A New Training Framework for Deep Neural Network

Arxiv

0+阅读 · 2021年3月25日

Optimally weighted loss functions for solving PDEs with Neural Networks

Optimally weighted loss functions for solving PDEs with Neural Networks

Arxiv

0+阅读 · 2021年3月24日

Directed Graph Convolutional Network

Arxiv

3+阅读 · 2020年4月29日

Reducing Parameter Space for Neural Network Training

Arxiv

3+阅读 · 2018年8月17日

Experience-driven Networking: A Deep Reinforcement Learning based Approach

Arxiv

9+阅读 · 2018年1月17日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【MIT韩松博士-ICLR2020】端上自动机器学习-一劳永逸网络的NAS: Once-for-All Network

【MIT韩松博士-ICLR2020】端上自动机器学习-一劳永逸网络的NAS: Once-for-All Network

专知会员服务

58+阅读 · 2020年5月4日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【MIT-MLSys2020】神经网络剪枝的研究进展状态，Neural Network Pruning

【MIT-MLSys2020】神经网络剪枝的研究进展状态，Neural Network Pruning

专知会员服务

29+阅读 · 2020年3月10日

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

【机器学习论文推荐】EfficientNet:卷积神经网络的再思考模型缩放（EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks）

专知会员服务

17+阅读 · 2019年12月24日

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

【ICCV 2019】基于元学习的自动化神经网络通道 MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

专知会员服务

17+阅读 · 2019年11月17日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

已删除

将门创投

5+阅读 · 2019年4月4日

相关论文

Policy Targeting under Network Interference

Policy Targeting under Network Interference

Arxiv

0+阅读 · 2021年3月25日

Layer-Wise Data-Free CNN Compression

Arxiv

0+阅读 · 2021年3月25日

Sample-efficient Plasma Spray Process Configuration with Constrained Bayesian Optimization

Arxiv

0+阅读 · 2021年3月25日

Pruning-then-Expanding Model for Domain Adaptation of Neural Machine Translation

Arxiv

0+阅读 · 2021年3月25日

Enabling Retrain-free Deep Neural Network Pruning using Surrogate Lagrangian Relaxation

Arxiv

0+阅读 · 2021年3月25日

A New Training Framework for Deep Neural Network

Arxiv

0+阅读 · 2021年3月25日

Optimally weighted loss functions for solving PDEs with Neural Networks

Optimally weighted loss functions for solving PDEs with Neural Networks

Arxiv

0+阅读 · 2021年3月24日

Directed Graph Convolutional Network

Arxiv

3+阅读 · 2020年4月29日

Reducing Parameter Space for Neural Network Training

Arxiv

3+阅读 · 2018年8月17日

Experience-driven Networking: A Deep Reinforcement Learning based Approach

Arxiv

9+阅读 · 2018年1月17日

微信扫码咨询专知VIP会员