学习动态系统数据驱动建模课程 (Curriculum learning for data-driven modeling of dynamical systems)

The reliable prediction of the temporal behavior of complex systems is key in numerous scientific fields. This strong interest is however hindered by modeling issues: often, the governing equations describing the physics of the system are not accessible or, when known, their solution might require a computational time incompatible with the prediction time constraints. Not surprisingly, approximating complex systems in a generic functional format and informing it ex-nihilo from available observations has become common practice in the age of machine learning, as illustrated by the numerous successful examples based on deep neural networks. However, generalizability of the models, margins of guarantee and the impact of data are often overlooked or examined mainly by relying on prior knowledge of the physics. We tackle these issues from a different viewpoint, by adopting a curriculum learning strategy. In curriculum learning, the dataset is structured such that the training process starts from simple samples towards more complex ones in order to favor convergence and generalization. The concept has been developed and successfully applied in robotics and control of systems. Here, we apply this concept for the learning of complex dynamical systems in a systematic way. First, leveraging insights from the ergodic theory, we assess the amount of data sufficient for a-priori guaranteeing a faithful model of the physical system and thoroughly investigate the impact of the training set and its structure on the quality of long-term predictions. Based on that, we consider entropy as a metric of complexity of the dataset; we show how an informed design of the training set based on the analysis of the entropy significantly improves the resulting models in terms of generalizability, and provide insights on the amount and the choice of data required for an effective data-driven modeling.

翻译：在许多科学领域,对复杂系统的时间行为的可靠预测是许多科学领域的关键所在。然而,这种强烈的兴趣却因模型问题而受阻:描述系统物理的治理方程式往往不易获得,或者当人们知道时,其解决方案可能需要一个与预测时间限制不相符的计算时间。毫不奇怪,以通用功能格式与复杂系统相近,并从现有观测结果中告知它前一号系统,在机器学习的时代,这一概念已成为常见的做法,在深层神经网络的众多成功实例中说明了这一点。然而,模型、保证的边缘和数据的影响往往被忽略或主要通过依赖先前的物理知识来加以研究。我们从不同的角度处理这些问题,采用课程学习战略。在课程学习中,数据集的结构结构如此简单,从简单的样本开始,转向更复杂的系统,以便有利于趋同和对系统的控制。在这里,我们应用这一概念来系统化地学习复杂的动态系统系统。首先,利用模型模型理论的洞察力,我们从不同的角度来处理这些问题,我们从一个不同的角度来评估结果数据的质量结构的准确度,然后为我们彻底的精确的精确的精确的系统测测测测测测数据值。