Gaussian process regression is a widely-applied method for function approximation and uncertainty quantification. The technique has gained popularity recently in the machine learning community due to its robustness and interpretability. The mathematical methods we discuss in this paper are an extension of the Gaussian-process framework. We are proposing advanced kernel designs that only allow for functions with certain desirable characteristics to be elements of the reproducing kernel Hilbert space (RKHS) that underlies all kernel methods and serves as the sample space for Gaussian process regression. These desirable characteristics reflect the underlying physics; two obvious examples are symmetry and periodicity constraints. In addition, non-stationary kernel designs can be defined in the same framework to yield flexible multi-task Gaussian processes. We will show the impact of advanced kernel designs on Gaussian processes using several synthetic and two scientific data sets. The results show that including domain knowledge, communicated through advanced kernel designs, has a significant impact on the accuracy and relevance of the function approximation.
翻译:Gausian 进程回归是功能近似和不确定性量化的一种广泛应用的方法。 这种技术由于其坚固性和可解释性,最近在机器学习界越来越受欢迎。 我们在本文件中讨论的数学方法是Gaussian过程框架的延伸。 我们提出的高级内核设计只允许具有某些理想特性的功能成为Hilbert 空间(RKHS)的元素,这些功能是所有内核方法的基础,并作为Gaussian 过程回归的样本空间。这些理想特征反映了基础物理学;两个明显的例子有对称性和周期性限制。此外,非静止内核设计可以在同一个框架内界定,以产生灵活的多任务高斯进程。我们将用几个合成和两个科学数据集显示高级内核设计对高斯进程的影响。结果显示,包括通过高级内核设计传播的域知识,对功能近似的准确性和相关性有重大影响。