These notes are designed with the aim of providing a clear and concise introduction to the subjects of Inverse Problems and Data Assimilation, and their inter-relations, together with citations to some relevant literature in this area. The first half of the notes is dedicated to studying the Bayesian framework for inverse problems. Techniques such as importance sampling and Markov Chain Monte Carlo (MCMC) methods are introduced; these methods have the desirable property that in the limit of an infinite number of samples they reproduce the full posterior distribution. Since it is often computationally intensive to implement these methods, especially in high dimensional problems, approximate techniques such as approximating the posterior by a Dirac or a Gaussian distribution are discussed. The second half of the notes cover data assimilation. This refers to a particular class of inverse problems in which the unknown parameter is the initial condition of a dynamical system, and in the stochastic dynamics case the subsequent states of the system, and the data comprises partial and noisy observations of that (possibly stochastic) dynamical system. We will also demonstrate that methods developed in data assimilation may be employed to study generic inverse problems, by introducing an artificial time to generate a sequence of probability measures interpolating from the prior to the posterior. The third and final part of the notes describes various topics which blend the theory of inverse problems, data assimilation, and machine learning. Whilst ideas from machine learning appear in the first two parts of the notes, the final part overviews the main ways in which machine learning is impacting on, and has the potential to impact on, both the subjects of inverse problems and data assimilation.
翻译:这些笔记旨在明确和简洁地介绍反向问题和数据同化主题及其相互关系,并引证这一领域的一些相关文献。前半笔记致力于研究巴伊西亚框架的反向问题。引入了重要取样和Markov链链蒙特卡洛(MCMC)方法等技术;这些方法具有可取的属性,在无限数量的样本中,它们复制了完整的后部分布。由于执行这些方法,特别是在高维问题中,往往需要大量计算,因此,大约的技术,例如接近Dirac或Gaussian分布的后部影响部分。前半笔记致力于研究巴伊西亚框架的反面问题。这是指一个特殊的反面问题,其中未知参数是动态系统的初始条件,而在随机动态动态的动态中,数据包括部分的局部和杂乱的观察,因此,从机器对后部影响部分的后期分析中,我们还将表明,从机器对后部理论的理论学方法,到先期数据序列中,通过对数字的循环学学,从中间的理论,从模型学到先导论中,从数字的理论学到先序中,从模型学了各种的理论,从数字学,从中间的理论学学学学学到最后分数序学。