Weight-tied models have attracted attention in the modern development of neural networks. The deep equilibrium model (DEQ) represents infinitely deep neural networks with weight-tying, and recent studies have shown the potential of this type of approach. DEQs are needed to iteratively solve root-finding problems in training and are built on the assumption that the underlying dynamics determined by the models converge to a fixed point. In this paper, we present the stable invariant model (SIM), a new class of deep models that in principle approximates DEQs under stability and extends the dynamics to more general ones converging to an invariant set (not restricted in a fixed point). The key ingredient in deriving SIMs is a representation of the dynamics with the spectra of the Koopman and Perron--Frobenius operators. This perspective approximately reveals stable dynamics with DEQs and then derives two variants of SIMs. We also propose an implementation of SIMs that can be learned in the same way as feedforward models. We illustrate the empirical performance of SIMs with experiments and demonstrate that SIMs achieve comparative or superior performance against DEQs in several learning tasks.
翻译:深度平衡模型(DEQ)代表着无限深的神经网络,具有重量限制,而最近的研究表明了这种类型方法的潜力。DEQ需要反复解决培训中的根查问题,并基于以下假设:由模型确定的基本动态会汇合到一个固定点。在本文中,我们介绍了稳定的不变化模型(SIM),这是一种新的深度模型,原则上在稳定性下接近DEQ,并将动态扩大到更一般的聚合成一个变异数据集(不受固定点限制)的模型。生成SIM的关键成分是反映与Koopman和Perron-Frobenius操作员的光谱体的动态。这种观点大致显示与DEQs的稳定动态,然后产生两个SIMs的变体。我们还建议采用与进化模型相同的方式来学习SIMs的经验性工作。我们用实验来说明SIMs在几项任务中取得对比性或优异性业绩,并表明SIMs在与DEQ学习几项任务中取得比较性或优性。