封闭式连续单元模型 (Closed-form Continuous-Depth Models)

Continuous-depth neural models, where the derivative of the model's hidden state is defined by a neural network, have enabled strong sequential data processing capabilities. However, these models rely on advanced numerical differential equation (DE) solvers resulting in a significant overhead both in terms of computational cost and model complexity. In this paper, we present a new family of models, termed Closed-form Continuous-depth (CfC) networks, that are simple to describe and at least one order of magnitude faster while exhibiting equally strong modeling abilities compared to their ODE-based counterparts. The models are hereby derived from the analytical closed-form solution of an expressive subset of time-continuous models, thus alleviating the need for complex DE solvers all together. In our experimental evaluations, we demonstrate that CfC networks outperform advanced, recurrent models over a diverse set of time-series prediction tasks, including those with long-term dependencies and irregularly sampled data. We believe our findings open new opportunities to train and deploy rich, continuous neural models in resource-constrained settings, which demand both performance and efficiency.

翻译：由神经网络来界定模型隐藏状态衍生物的连续深度神经模型,通过这种模型的衍生物由神经网络来界定,因此能够产生强大的连续处理数据的能力;然而,这些模型依靠先进的数字差异方程式(DE)解答器,从而在计算成本和模型复杂性两方面都产生了巨大的间接费用。在本文中,我们提出了一套新的模型,称为封闭式连续深度网络(CfC),这些模型简单易描述,而且至少一个数量级更快,同时展示出与其基于 ODE 的对应方相比同样强大的建模能力。这些模型由此产生出自一组明确的时间持续模型的分析封闭式解决方案,从而减轻了对复杂的DE 解析器的需求。在我们的实验性评估中,我们证明CfC 网络在一系列不同的时间序列预测任务中,包括长期依赖性和不定期抽样的数据,超越了先进的经常性模型。我们认为,我们的调查结果为在资源紧张的环境中培训和部署丰富、连续的神经模型开辟了新的机会,这要求既具有性又具有效率。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/