Bayesian posterior distributions arising in modern applications, including inverse problems in partial differential equation models in tomography and subsurface flow, are often computationally intractable due to the large computational cost of evaluating the data likelihood. To alleviate this problem, we consider using Gaussian process regression to build a surrogate model for the likelihood, resulting in an approximate posterior distribution that is amenable to computations in practice. This work serves as an introduction to Gaussian process regression, in particular in the context of building surrogate models for inverse problems, and presents new insights into a suitable choice of training points. We show that the error between the true and approximate posterior distribution can be bounded by the error between the true and approximate likelihood, measured in the $L^2$-norm weighted by the true posterior, and that efficiently bounding the error between the true and approximate likelihood in this norm suggests choosing the training points in the Gaussian process surrogate model based on the true posterior.
翻译:现代应用中产生的Bayesian 后方分布物,包括断层摄影和地表下流部分等式模型的反向问题,由于评估数据可能性的计算成本巨大,往往在计算上难以解决。为了减轻这一问题,我们考虑使用高斯进程回归法来建立一种可能性替代模型,从而形成一种可以实际计算出的一种近似后方分布物。这项工作是高斯进程回归的引言,特别是在为反向问题建立代用模型方面,并为适当选择培训点提供新的见解。我们表明,真实和近似后方分布物分布物之间的错误可以受真实和近似可能性之间误差的束缚,以按真实的后方计算值加权的2美元-诺姆计算,并且有效地将本规范中真实和近似可能性之间的误差捆绑起来,从而建议根据真实的后台选择高斯进程模型的培训点。