Recent studies have demonstrated that it is possible to combine machine learning with data assimilation to reconstruct the dynamics of a physical model partially and imperfectly observed. Data assimilation is used to estimate the system state from the observations, while machine learning computes a surrogate model of the dynamical system based on those estimated states. The surrogate model can be defined as an hybrid combination where a physical model based on prior knowledge is enhanced with a statistical model estimated by a neural network. The training of the neural network is typically done offline, once a large enough dataset of model state estimates is available. By contrast, with online approaches the surrogate model is improved each time a new system state estimate is computed. Online approaches naturally fit the sequential framework encountered in geosciences where new observations become available with time. In a recent methodology paper, we have developed a new weak-constraint 4D-Var formulation which can be used to train a neural network for online model error correction. In the present article, we develop a simplified version of that method, in the incremental 4D-Var framework adopted by most operational weather centres. The simplified method is implemented in the ECMWF Object-Oriented Prediction System, with the help of a newly developed Fortran neural network library, and tested with a two-layer two-dimensional quasi geostrophic model. The results confirm that online learning is effective and yields a more accurate model error correction than offline learning. Finally, the simplified method is compatible with future applications to state-of-the-art models such as the ECMWF Integrated Forecasting System.
翻译:最近的研究显示,有可能将机器学习与数据同化结合起来,以重建部分和不完全观测到的物理模型的动态。数据同化用于从观测中估计系统状态,而机器学习则根据这些估计状态计算动态系统的代谢模型。代孕模型可以定义为一种混合组合,即基于先前知识的物理模型通过神经网络估计的统计模型得到加强。神经网络的培训一般是离线进行的,一旦有足够的模型状态估计数数据集到位。相比之下,在每次计算新的系统应用数据时,代孕模型就改进了在线方法。在线方法自然符合地质科学中出现的、有时间进行新观测的顺序框架。在最近的一份方法文件中,我们开发了一个新的弱度-约束性4D-Var模型,可用于培训在线模型错误纠正的神经网络。在本文章中,我们开发了一个简化的方法,在大多数运行的气候中心采用的递增 4D-Var 框架里,每次使用新的系统应用的兼容性应用状态估算性应用数据。在地质科学科学中,在新简化的 ECMFFS-Oor-OLL 上, 测试了一种精确的精确性系统,在最新的系统上,在进行精确的精确的系统上进行了精确的系统上,在进行精确的系统上,在进行精确的系统上进行了精确的学习。