Data assimilation techniques are widely used to predict complex dynamical systems with uncertainties, based on time-series observation data. Error covariance matrices modelling is an important element in data assimilation algorithms which can considerably impact the forecasting accuracy. The estimation of these covariances, which usually relies on empirical assumptions and physical constraints, is often imprecise and computationally expensive especially for systems of large dimension. In this work, we propose a data-driven approach based on long short term memory (LSTM) recurrent neural networks (RNN) to improve both the accuracy and the efficiency of observation covariance specification in data assimilation for dynamical systems. Learning the covariance matrix from observed/simulated time-series data, the proposed approach does not require any knowledge or assumption about prior error distribution, unlike classical posterior tuning methods. We have compared the novel approach with two state-of-the-art covariance tuning algorithms, namely DI01 and D05, first in a Lorenz dynamical system and then in a 2D shallow water twin experiments framework with different covariance parameterization using ensemble assimilation. This novel method shows significant advantages in observation covariance specification, assimilation accuracy and computational efficiency.
翻译:根据时间序列观测数据,数据同化技术被广泛用于预测具有不确定性的复杂动态系统; 错误共变矩阵模型是数据同化算法中的一个重要要素,可以对预测准确性产生重大影响; 这些共变法通常依赖经验假设和物理限制,对这些共变法的估计往往不精确,而且计算费用昂贵,特别是对于大维系统。在这项工作中,我们提议以长期短期内存(LSTM)经常神经网络(RNN)为基础的数据驱动方法,以提高动态系统数据同化数据同化的观测共变规格的准确性和效率。从观测/模拟的时间序列数据中学习常变式矩阵,拟议方法不需要对先前错误分布的任何了解或假设,而不像古典的外表调整方法。我们把新办法与两种最先进的共变法调算算法(即DI01和D05)相比较,首先在Lorenz动态系统中,然后在2D浅水双实验框架中,利用感应感应同化的不同共变度参数化法,这种新方法显示了观测共变法的精确性和同化方法的重大优点。