Wood Fisher: 神经网络压缩的高效二级近端 (WoodFisher: Efficient Second-Order Approximation for Neural Network Compression)

Second-order information, in the form of Hessian- or Inverse-Hessian-vector products, is a fundamental tool for solving optimization problems. Recently, there has been significant interest in utilizing this information in the context of deep neural networks; however, relatively little is known about the quality of existing approximations in this context. Our work examines this question, identifies issues with existing approaches, and proposes a method called WoodFisher to compute a faithful and efficient estimate of the inverse Hessian. Our main application is to neural network compression, where we build on the classic Optimal Brain Damage/Surgeon framework. We demonstrate that WoodFisher significantly outperforms popular state-of-the-art methods for one-shot pruning. Further, even when iterative, gradual pruning is considered, our method results in a gain in test accuracy over the state-of-the-art approaches, for pruning popular neural networks (like ResNet-50, MobileNetV1) trained on standard image classification datasets such as ImageNet ILSVRC. We examine how our method can be extended to take into account first-order information, as well as illustrate its ability to automatically set layer-wise pruning thresholds and perform compression in the limited-data regime. The code is available at the following link, https://github.com/IST-DASLab/WoodFisher.

翻译：第二顺序信息,以Hessian或Invers-Hessian-Victor产品的形式,是解决优化问题的基本工具。最近,人们对在深层神经网络中利用这一信息的兴趣很大;然而,对于这方面的现有近似质量了解相对较少。我们的工作研究这一问题,找出现有方法的问题,并提议一种叫WoodFisher的方法,以忠实和高效地估算逆向赫瑟。我们的主要应用是神经网络压缩,我们在那里建立经典的“最佳脑损伤/外生”框架。我们证明,WoodFisher明显超越了在一线性神经网络中流行的“艺术状态”方法。此外,即使考虑过迭接、逐步的“调整”方法,我们的方法也使得人们能够测试现有“现代”方法的准确性,用于运行大众神经网络(如ResNet-50,移动NetV1), 用于进行神经网络压缩,我们在那里建立典型的图像网络网络网络/外生化框架。我们研究了Wisher Food-Art-destrual-deal-deal-deal-deal-deformal-defornal-de-deal-deal-de-de-de-de-de-deal-destrubal-s),我们的方法是如何可以自动地将数据连接到一个有限的数据序列。我们是如何的系统。我们是如何将数据系统。我们是如何将数据序列的系统-de-de-maismal-de-de-de-de-de-destrutismal-de-de-de-de-sal-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-deal-deal-deal-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-de-

相关内容

Neural Networks

关注 1649

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【ICML2020】用于图结构化数据的卷积核网络，Convolutional Kernel Networks for Graph-Structured Data

专知会员服务

44+阅读 · 2020年6月29日