Practical data assimilation algorithms often contain hyper-parameters, which may arise due to, for instance, the use of certain auxiliary techniques like covariance inflation and localization in an ensemble Kalman filter, the re-parameterization of certain quantities such as model and/or observation error covariance matrices, and so on. Given the richness of the established assimilation algorithms, and the abundance of the approaches through which hyper-parameters are introduced to the assimilation algorithms, one may ask whether it is possible to develop a sound and generic method to efficiently choose various types of (sometimes high-dimensional) hyper-parameters. This work aims to explore a feasible, although likely partial, answer to this question. Our main idea is built upon the notion that a data assimilation algorithm with hyper-parameters can be considered as a parametric mapping that links a set of quantities of interest (e.g., model state variables and/or parameters) to a corresponding set of predicted observations in the observation space. As such, the choice of hyper-parameters can be recast as a parameter estimation problem, in which our objective is to tune the hyper-parameters in such a way that the resulted predicted observations can match the real observations to a good extent. From this perspective, we propose a hyper-parameter estimation workflow and investigate the performance of this workflow in an ensemble Kalman filter. In a series of experiments, we observe that the proposed workflow works efficiently even in the presence of a relatively large amount (up to $10^3$) of hyper-parameters, and exhibits reasonably good and consistent performance under various conditions.
翻译:实际数据同化算法通常包含超参数,这可能是由于使用某些辅助技术,例如全方位卡尔曼过滤器中的共差通胀和本地化等技术,对模型和/或观测误差共差矩阵等某些数量进行重新校准,等等。鉴于既定同化算法的丰富性,以及将超参数引入同化算法的方法的丰富性,人们可能会问是否有可能制定一种合理和通用的方法,以便有效地选择各种类型的(有时是高维)超参数。这项工作的目的是探索一种可行(尽管可能是局部的)对该问题的答案。我们的主要想法是基于这样一种概念,即数据同化算法与超参数差差差差矩阵等某些数量可以被视为一种参数性的映射图,将一组利益(例如,模型状态变量和/或参数)与一套相应的观测空间预测值联系起来。因此,各种超参数的选择可以被重新定位为一种参数估测度(有时是高度的)超参数,甚至可能部分地回答这个问题。我们的目标在于从一个长期的轨道观察到一个长期的轨道水平,我们从一个预测的轨道上,从一个预测到一个长期的轨道上,从一个持续的轨道,从一个预测到一个预测到一个持续的轨道上,从一个方向,从一个我们测测测测测测算出一个。