Residual deep neural networks (ResNets) are mathematically described as interacting particle systems. In the case of infinitely many layers the ResNet leads to a system of coupled system of ordinary differential equations known as neural differential equations. For large scale input data we derive a mean--field limit and show well--posedness of the resulting description. Further, we analyze the existence of solutions to the training process by using both a controllability and an optimal control point of view. Numerical investigations based on the solution of a formal optimality system illustrate the theoretical findings.
翻译:残余的深神经网络(ResNets)在数学上被描述为互动粒子系统。在无限多的层中,ResNet导致形成一个普通差异方程式的组合系统,称为神经差异方程式。对于大规模输入数据,我们得出一个平均场限,并显示由此产生的描述的准确性。此外,我们通过使用控制性和最佳控制点分析培训过程存在的解决办法。基于正规最佳度系统解决方案的数值调查显示了理论结论。