Designing a high-efficiency and high-quality expressive network architecture has always been the most important research topic in the field of deep learning. Most of today's network design strategies focus on how to integrate features extracted from different layers, and how to design computing units to effectively extract these features, thereby enhancing the expressiveness of the network. This paper proposes a new network design strategy, i.e., to design the network architecture based on gradient path analysis. On the whole, most of today's mainstream network design strategies are based on feed forward path, that is, the network architecture is designed based on the data path. In this paper, we hope to enhance the expressive ability of the trained model by improving the network learning ability. Due to the mechanism driving the network parameter learning is the backward propagation algorithm, we design network design strategies based on back propagation path. We propose the gradient path design strategies for the layer-level, the stage-level, and the network-level, and the design strategies are proved to be superior and feasible from theoretical analysis and experiments.
翻译:设计高效和高质量的表达式网络结构一直是深层学习领域最重要的研究课题。今天的网络设计战略大多侧重于如何整合从不同层次提取的特征,以及如何设计计算单位以有效提取这些特征,从而提高网络的清晰度。本文提出了一个新的网络设计战略,即根据梯度路径分析设计网络结构。总体而言,今天的主流网络设计战略大多基于前方路径,即网络结构是根据数据路径设计的。在本文件中,我们希望通过提高网络学习能力,提高经过培训的模式的表达能力。由于驱动网络参数学习的机制是落后的传播算法,我们设计基于后向传播路径的网络设计战略。我们提出了层次、阶段层次和网络层次的梯度设计战略,而设计战略已证明从理论分析和实验来看是优和可行的。