体重演变:通过不断演变的深神经网培训,提高深神经网络的强度值 (Weight Evolution: Improving Deep Neural Networks Training through Evolving Inferior Weight Values)

To obtain good performance, convolutional neural networks are usually over-parameterized. This phenomenon has stimulated two interesting topics: pruning the unimportant weights for compression and reactivating the unimportant weights to make full use of network capability. However, current weight reactivation methods usually reactivate the entire filters, which may not be precise enough. Looking back in history, the prosperity of filter pruning is mainly due to its friendliness to hardware implementation, but pruning at a finer structure level, i.e., weight elements, usually leads to better network performance. We study the problem of weight element reactivation in this paper. Motivated by evolution, we select the unimportant filters and update their unimportant elements by combining them with the important elements of important filters, just like gene crossover to produce better offspring, and the proposed method is called weight evolution (WE). WE is mainly composed of four strategies. We propose a global selection strategy and a local selection strategy and combine them to locate the unimportant filters. A forward matching strategy is proposed to find the matched important filters and a crossover strategy is proposed to utilize the important elements of the important filters for updating unimportant filters. WE is plug-in to existing network architectures. Comprehensive experiments show that WE outperforms the other reactivation methods and plug-in training methods with typical convolutional neural networks, especially lightweight networks. Our code is available at https://github.com/BZQLin/Weight-evolution.

翻译：为了取得良好的性能, 进化神经网络通常会过于光滑。这种现象刺激了两个有趣的话题: 调整不重要的重量, 压缩和重新激活不重要的重量, 以充分利用网络能力。然而, 当前重量的恢复方法通常会重新启动整个过滤器, 这可能不够精确。回顾历史, 过滤运行的繁荣主要在于它是否有利于硬件的落实, 但是在一个更细的结构层次上运行, 即重量元素, 通常会导致更好的网络性能。我们研究的是重力元素在本文中重新激活的问题。受进化的驱动, 我们选择了不重要的过滤器, 并通过将其与重要过滤器中的重要元素相结合来更新这些不重要的元素, 就像基因交叉来产生更好的后代, 所建议的方法被称为权重演进( WE)。我们主要由四种战略组成。我们提出一个全球选择战略和本地选择战略, 并把它们合并到一个不重要的过滤器。一个前瞻性匹配战略, 以找到匹配的重要过滤器和交叉的策略。受进化的过滤器, 我们的常规网络中的重要的升级方法, 展示其它的升级方法。