具有噪音和昂贵可能性的模型群集引引法 (Ensemble Inference Methods for Models With Noisy and Expensive Likelihoods)

The increasing availability of data presents an opportunity to calibrate unknown parameters which appear in complex models of phenomena in the biomedical, physical and social sciences. However, model complexity often leads to parameter-to-data maps which are expensive to evaluate and are only available through noisy approximations. This paper is concerned with the use of interacting particle systems for the solution of the resulting inverse problems for parameters. Of particular interest is the case where the available forward model evaluations are subject to rapid fluctuations, in parameter space, superimposed on the smoothly varying large scale parametric structure of interest. {A motivating example from climate science is presented, and ensemble Kalman methods (which do not use the derivative of the parameter-to-data map) are shown, empirically, to perform well. Multiscale analysis is then used to analyze the behaviour of interacting particle system algorithms when rapid fluctuations, which we refer to as noise, pollute the large scale parametric dependence of the parameter-to-data map. Ensemble Kalman methods and Langevin-based methods} (the latter use the derivative of the parameter-to-data map) are compared in this light. The ensemble Kalman methods are shown to behave favourably in the presence of noise in the parameter-to-data map, whereas Langevin methods are adversely affected. On the other hand, Langevin methods have the correct equilibrium distribution in the setting of noise-free forward models, whilst ensemble Kalman methods only provide an uncontrolled approximation, except in the linear case. Therefore a new class of algorithms, ensemble Gaussian process samplers, which combine the benefits of both ensemble Kalman and Langevin methods, are introduced and shown to perform favourably.

翻译：数据提供量的增加为校正生物医学、物理科学和社会科学中复杂的现象模型中出现的未知参数提供了机会。然而,模型的复杂性往往导致使用非常昂贵的参数到数据地图,评估费用昂贵,只能通过杂音近似来获取。本文涉及使用互动粒子系统来解决由此产生的参数问题。特别令人感兴趣的是,在参数空间中,现有的远期模型评价受到快速波动的影响,并被高频不同规模的大型参数参数参数性结构所取代。 {从气候科学中展示了一个激励性实例,并且用混合的卡尔曼方法(不使用参数到数据图的衍生物)来进行评估,这些方法在实验中显示,用共同的参数到数据,然后用多尺度分析在快速波动时使用互动的粒子系统算法的行为。我们称之为噪音,通过对参数到数据地图的大规模偏差依赖度进行污染。Entsemble Kalman方法和Langevin法 {(后者使用参数到数据直径图的衍生物)在这种光线性模型中进行对比。在数值到正值上显示的参数到正值的方法,而参数显示, 方向方法则显示, 方向显示, 方向显示, 方向的方法则显示, 方向显示, 方向法的方法显示, 方向法则显示, 方向显示, 方向显示, 方向方向方向方向显示方向方向方向方向方向显示方向显示方向显示方向方向方向方向方向显示显示显示方向显示方向方向方向方向方向显示显示显示方向方向方向方向方法显示显示方向方向显示显示方向方向方法显示显示方向显示显示显示方向方向方向方向方法方法显示显示显示方法显示显示显示显示方法方法显示方向方向显示显示方向方向显示方向方向显示显示显示显示显示显示显示方向方向显示显示显示显示方法方向方向显示方向方向显示显示方向方向方向显示显示方向方向方向方向显示显示