A convolutional neural network can be constructed using numerical methods for solving dynamical systems, since the forward pass of the network can be regarded as a trajectory of a dynamical system. However, existing models based on numerical solvers cannot avoid the iterations of implicit methods, which makes the models inefficient at inference time. In this paper, we reinterpret the pre-activation Residual Networks (ResNets) and their variants from the dynamical systems view. We consider that the iterations of implicit Runge-Kutta methods are fused into the training of these models. Moreover, we propose a novel approach to constructing network models based on high-order Runge-Kutta methods in order to achieve higher efficiency. Our proposed models are referred to as the Runge-Kutta Convolutional Neural Networks (RKCNNs). The RKCNNs are evaluated on multiple benchmark datasets. The experimental results show that RKCNNs are vastly superior to other dynamical system network models: they achieve higher accuracy with much fewer resources. They also expand the family of network models based on numerical methods for dynamical systems.
翻译:可以使用数字方法解决动态系统,因为网络的前方路口可被视为动态系统的轨迹。然而,基于数字求解器的现有模型无法避免隐含方法的迭代,使模型在推论时间效率低下。在本文中,我们重新解释前起动残余网络(ResNets)及其在动态系统视图中的变体。我们认为,隐含龙格-库塔方法的迭代会结合到这些模型的培训中。此外,我们提出了一种新颖的方法来构建基于高阶龙格-库塔方法的网络模型,以便实现更高的效率。我们提议的模型称为Runge-库塔进动神经网络(RKCNNs)。RKCNNs是在多个基准数据集上进行评估的。实验结果表明,RKCNNs与其他动态系统网络模型相比,其高度优于其他动态系统模型:它们以更少的资源实现更高的准确性。它们还扩大了基于动态系统数字方法的网络模型的组合。