Implicit neural networks, a.k.a., deep equilibrium networks, are a class of implicit-depth learning models where function evaluation is performed by solving a fixed point equation. They generalize classic feedforward models and are equivalent to infinite-depth weight-tied feedforward networks. While implicit models show improved accuracy and significant reduction in memory consumption, they can suffer from ill-posedness and convergence instability. This paper provides a new framework, which we call Non-Euclidean Monotone Operator Network (NEMON), to design well-posed and robust implicit neural networks based upon contraction theory for the non-Euclidean norm $\ell_{\infty}$. Our framework includes (i) a novel condition for well-posedness based on one-sided Lipschitz constants, (ii) an average iteration for computing fixed-points, and (iii) explicit estimates on input-output Lipschitz constants. Additionally, we design a training problem with the well-posedness condition and the average iteration as constraints and, to achieve robust models, with the input-output Lipschitz constant as a regularizer. Our $\ell_{\infty}$ well-posedness condition leads to a larger polytopic training search space than existing conditions and our average iteration enjoys accelerated convergence. Finally, we evaluate our framework in image classification through the MNIST and the CIFAR-10 datasets. Our numerical results demonstrate improved accuracy and robustness of the implicit models with smaller input-output Lipschitz bounds. Code is available at https://github.com/davydovalexander/Non-Euclidean_Mon_Op_Net.
翻译:深平衡网络( a.k.a.a.) 是一组隐含的深入学习模型, 通过解决固定点方程式来进行功能评估。 这些模型将典型的进化型进化型模型普遍化, 相当于无限深重的进化型进化型网络。 虽然隐含型模型显示的准确性提高和记忆消耗量显著减少, 但是它们可能会受到错误和趋同性不稳定的影响。 本文提供了一个新框架, 我们称之为非欧洲光化单体操作者网络( NEMON), 以非欧洲光化规范的收缩理论为基础, 设计有良好和强势的内向型神经网络网络网络网络。 我们的框架包括 (一) 基于单面的利普西茨常量常量常量的正确度的新条件, (二) 计算固定点的平均值, (三) 输入-输出利普施维茨常量常量的常量。 此外,我们设计了一种精度的内存性内存性、平均内存性精度的内置型内置型网络的内置型网络, 实现坚固的模型。