神经网络压缩比较研究 (A Comparative Study of Neural Network Compression)

There has recently been an increasing desire to evaluate neural networks locally on computationally-limited devices in order to exploit their recent effectiveness for several applications; such effectiveness has nevertheless come together with a considerable increase in the size of modern neural networks, which constitute a major downside in several of the aforementioned computationally-limited settings. There has thus been a demand of compression techniques for neural networks. Several proposal in this direction have been made, which famously include hashing-based methods and pruning-based ones. However, the evaluation of the efficacy of these techniques has so far been heterogeneous, with no clear evidence in favor of any of them over the others. The goal of this work is to address this latter issue by providing a comparative study. While most previous studies test the capability of a technique in reducing the number of parameters of state-of-the-art networks , we follow [CWT + 15] in evaluating their performance on basic ar-chitectures on the MNIST dataset and variants of it, which allows for a clearer analysis of some aspects of their behavior. To the best of our knowledge, we are the first to directly compare famous approaches such as HashedNet, Optimal Brain Damage (OBD), and magnitude-based pruning with L1 and L2 regularization among them and against equivalent-size feed-forward neural networks with simple (fully-connected) and structural (convolutional) neural networks. Rather surprisingly, our experiments show that (iterative) pruning-based methods are substantially better than the HashedNet architecture, whose compression doesn't appear advantageous to a carefully chosen convolutional network. We also show that, when the compression level is high, the famous OBD pruning heuristics deteriorates to the point of being less efficient than simple magnitude-based techniques.

翻译：最近人们越来越希望在当地对计算上有限的装置的神经网络进行评估,以便利用这些技术最近的一些应用的功效;然而,这种效力与现代神经网络规模的大幅扩大结合在一起,而现代神经网络的规模是上述若干计算上的限制设置的主要下坡。因此,对神经网络的需求是压缩技术。在这方面提出了几项建议,其中著名的包括基于仓储的方法和基于剪裁的方法。然而,对这些技术的功效的评价迄今是多种多样的,没有任何明显的证据支持这些技术对其他应用的效益。这项工作的目标是通过提供比较研究来解决后一个问题。虽然大多数先前的研究测试了减少最新网络参数数量的技术的能力,但我们在评价其基础的电离心和基于其基础的变异方法方面表现得更差(MNIT数据集和基于其基础的变异,使得对其行为的某些方面进行更明确的分析。我们最清楚的就是,我们最精细的内脏2 和直线网络比直线1 结构规模要高。我们最精细的直径直径直径直径直径直的网络和直径直径直向直径直径直径直向直径直的网络展示了。

相关内容

Neural Networks

关注 1644

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

深度神经网络模型的个体差异，Individual differences among deep neural network models

专知会员服务

10+阅读 · 2020年1月11日