失落在普鲁宁:超越测试精确度的普鲁宁神经网络的影响 (Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy) - 专知论文

会员服务 ·

0

剪枝 · 模型评估 · Performer · Networking · Neural Networks ·

2021 年 3 月 4 日

Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy

翻译：失落在普鲁宁:超越测试精确度的普鲁宁神经网络的影响

Lucas Liebenwein,Cenk Baykal,Brandon Carter,David Gifford,Daniela Rus

from arxiv, Published in MLSys 2021

Neural network pruning is a popular technique used to reduce the inference costs of modern, potentially overparameterized, networks. Starting from a pre-trained network, the process is as follows: remove redundant parameters, retrain, and repeat while maintaining the same test accuracy. The result is a model that is a fraction of the size of the original with comparable predictive performance (test accuracy). Here, we reassess and evaluate whether the use of test accuracy alone in the terminating condition is sufficient to ensure that the resulting model performs well across a wide spectrum of "harder" metrics such as generalization to out-of-distribution data and resilience to noise. Across evaluations on varying architectures and data sets, we find that pruned networks effectively approximate the unpruned model, however, the prune ratio at which pruned networks achieve commensurate performance varies significantly across tasks. These results call into question the extent of \emph{genuine} overparameterization in deep learning and raise concerns about the practicability of deploying pruned networks, specifically in the context of safety-critical systems, unless they are widely evaluated beyond test accuracy to reliably predict their performance. Our code is available at https://github.com/lucaslie/torchprune.

翻译：神经网络运行是一种常用的技术,用于降低现代的、可能过度光化的网络的推断成本。从预先培训的网络开始,这一过程如下:删除冗余参数、再培训和重复,同时保持同样的测试准确性。结果是原始规模的一小部分模型,具有可比较的预测性能(测试准确性)。在这里,我们重新评估和评价单在终止条件中使用测试准确性是否足以确保由此产生的模型在广泛的“硬度”指标中运行良好,如向外分布数据和对噪音的复原力等。从对不同结构和数据集的跨度评价开始,我们发现经调整的网络有效地接近未调整的模型,然而,经调整的网络取得相应性能的纯度比率在各个任务之间差异很大。这些结果使人们质疑深层学习中是否过度使用测试性能,并引起对部署经运行的网络的可行性的关切,特别是在安全临界系统中,除非对测试性能进行广泛的评估,以可靠地预测其性能。

0

相关内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【2020新书】实战测试自动化，Practical Test Automation，327页pdf

【2020新书】实战测试自动化，Practical Test Automation，327页pdf

专知会员服务

34+阅读 · 2020年8月26日

【干货书】Python深度学习第二版，Deep Learning with Python, Second Edition

【干货书】Python深度学习第二版，Deep Learning with Python, Second Edition

专知会员服务

167+阅读 · 2020年5月9日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【Github2.2K星】PyTorch资源列表：450个NLP/CV/SP、论文实现、教程、示例

【Github2.2K星】PyTorch资源列表：450个NLP/CV/SP、论文实现、教程、示例

新智元

6+阅读 · 2018年10月22日

已删除

将门创投

12+阅读 · 2018年6月25日

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

专知

10+阅读 · 2018年6月8日

【论文推荐】最新5篇深度学习相关论文推介——感知度量、图像检索、联合视盘和视杯分割、谱聚类、MPI并行

【论文推荐】最新5篇深度学习相关论文推介——感知度量、图像检索、联合视盘和视杯分割、谱聚类、MPI并行

专知

6+阅读 · 2018年1月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Fast Adaptation with Linearized Neural Networks

Arxiv

0+阅读 · 2021年4月28日

Normalization effects on shallow neural networks and related asymptotic expansions

Arxiv

0+阅读 · 2021年4月26日

Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks

Arxiv

0+阅读 · 2021年4月26日

Numerical Exploration of Training Loss Level-Sets in Deep Neural Networks

Arxiv

0+阅读 · 2021年4月24日

Achieving Small Test Error in Mildly Overparameterized Neural Networks

Arxiv

0+阅读 · 2021年4月24日

Neural basis expansion analysis with exogenous variables: Forecasting electricity prices with NBEATSx

Arxiv

0+阅读 · 2021年4月23日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Arxiv

15+阅读 · 2020年3月26日

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Arxiv

7+阅读 · 2019年9月10日

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

Arxiv

3+阅读 · 2018年11月20日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【2020新书】实战测试自动化，Practical Test Automation，327页pdf

【2020新书】实战测试自动化，Practical Test Automation，327页pdf

专知会员服务

34+阅读 · 2020年8月26日

【干货书】Python深度学习第二版，Deep Learning with Python, Second Edition

【干货书】Python深度学习第二版，Deep Learning with Python, Second Edition

专知会员服务

167+阅读 · 2020年5月9日

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

【Google】神经架构搜索（Neural Architecture Search and Beyond），Barret Zoph

专知会员服务

31+阅读 · 2019年11月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

通信行业：智能低空通感网络白皮书

3D形状生成：综述

6000字《伊朗-以色列战争解析：欺骗与信息战如何塑造公众认知》最新报告（附原文）

【博士论文】优化智能体工作流以提升信息获取效率

相关资讯

灾难性遗忘问题新视角：迁移-干扰平衡

灾难性遗忘问题新视角：迁移-干扰平衡

CreateAMind

17+阅读 · 2019年7月6日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

【Github2.2K星】PyTorch资源列表：450个NLP/CV/SP、论文实现、教程、示例

【Github2.2K星】PyTorch资源列表：450个NLP/CV/SP、论文实现、教程、示例

新智元

6+阅读 · 2018年10月22日

已删除

将门创投

12+阅读 · 2018年6月25日

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

【论文推荐】最新五篇视觉问答相关论文—深度学习评价、交互注意融合、VizWiz、引导注意力、

专知

10+阅读 · 2018年6月8日

【论文推荐】最新5篇深度学习相关论文推介——感知度量、图像检索、联合视盘和视杯分割、谱聚类、MPI并行

【论文推荐】最新5篇深度学习相关论文推介——感知度量、图像检索、联合视盘和视杯分割、谱聚类、MPI并行

专知

6+阅读 · 2018年1月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Fast Adaptation with Linearized Neural Networks

Arxiv

0+阅读 · 2021年4月28日

Normalization effects on shallow neural networks and related asymptotic expansions

Arxiv

0+阅读 · 2021年4月26日

Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks

Arxiv

0+阅读 · 2021年4月26日

Numerical Exploration of Training Loss Level-Sets in Deep Neural Networks

Arxiv

0+阅读 · 2021年4月24日

Achieving Small Test Error in Mildly Overparameterized Neural Networks

Arxiv

0+阅读 · 2021年4月24日

Neural basis expansion analysis with exogenous variables: Forecasting electricity prices with NBEATSx

Arxiv

0+阅读 · 2021年4月23日

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

Arxiv

14+阅读 · 2021年1月31日

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Bridging the Gap Between Spectral and Spatial Domains in Graph Neural Networks

Arxiv

15+阅读 · 2020年3月26日

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Auto-GNN: Neural Architecture Search of Graph Neural Networks

Arxiv

7+阅读 · 2019年9月10日

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism

Arxiv

3+阅读 · 2018年11月20日

微信扫码咨询专知VIP会员