利用动态系统的结构进行数据驱动型建模 (Leveraging the structure of dynamical systems for data-driven modeling)

The reliable prediction of the temporal behavior of complex systems is required in numerous scientific fields. This strong interest is however hindered by modeling issues: often, the governing equations describing the physics of the system under consideration are not accessible or, when known, their solution might require a computational time incompatible with the prediction time constraints. Nowadays, approximating complex systems at hand in a generic functional format and informing it ex--nihilo from available observations has become a common practice, as illustrated by the enormous amount of scientific work appeared in the last years. Numerous successful examples based on deep neural networks are already available, although generalizability of the models and margins of guarantee are often overlooked. Here, we consider Long-Short Term Memory neural networks and thoroughly investigate the impact of the training set and its structure on the quality of the long-term prediction. Leveraging insights from ergodic theory, we perform a thorough computational analysis to assess the amount of data sufficient for a priori guaranteeing a faithful model of the physical system. We show how an informed design of the training set, based on invariants of the system and the structure of the underlying attractor, significantly improves the resulting models, opening up avenues for research within the context of active learning. Further, the non-trivial effects of the memory initializations when relying on memory-capable models will be illustrated. Our findings provide evidence-based good-practice on the amount and the choice of data required for an effective data-driven modeling of any complex dynamical system.

翻译：在许多科学领域,需要对复杂系统的时间行为进行可靠的预测,但这种强烈的兴趣却受到模拟问题的影响:通常,描述所审议系统物理的治理方程式无法进入,或者当已知时,其解决方案可能需要一个与预测时间限制不相符的计算时间。如今,以通用功能格式将手头的复杂系统近似复杂系统,并从现有观测结果中为它提供参考,这已成为一种常见做法,过去几年里出现了大量科学工作,说明大量科学工作,许多基于深层神经网络的成功范例已经存在,尽管往往忽视模型的通用性和担保的边缘。在这里,我们考虑长期短期记忆神经网络,并彻底调查培训组及其结构对长期预测质量的影响。我们从原创理论中汲取洞察力,进行彻底的计算分析,评估数据数量足以预先保证物理系统的忠实模型。我们展示了如何根据系统变量和潜在吸引力模型的结构对培训组进行知情设计。我们考虑长期记忆记忆内记忆基础模型的动态分析将大大改进我们最初的模型的动态分析。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日