We generalize fast Gaussian process leave-one-out formulae to multiple-fold cross-validation, highlighting in turn in broad settings the covariance structure of cross-validation residuals. The employed approach, that relies on block matrix inversion via Schur complements, is applied to both Simple and Universal Kriging frameworks. We illustrate how resulting covariances affect model diagnostics and how to properly transform residuals in the first place. Beyond that, we examine how accounting for dependency between such residuals affect cross-validation-based estimation of the scale parameter. It is found in two distinct cases, namely in scale estimation and in broader covariance parameter estimation via pseudo-likelihood, that correcting for covariances between cross-validation residuals leads back to maximum likelihood estimation or to an original variation thereof. The proposed fast calculation of Gaussian Process multiple-fold cross-validation residuals is implemented and benchmarked against a naive implementation, all in R language. Numerical experiments highlight the accuracy of our approach as well as the substantial speed-ups that it enables. It is noticeable however, as supported by a discussion on the main drivers of computational costs and by a dedicated numerical benchmark, that speed-ups steeply decline as the number of folds (say, all sharing the same size) decreases. Overall, our results enable fast multiple-fold cross-validation, have direct consequences in GP model diagnostics, and pave the way to future work on hyperparameter fitting as well as on the promising field of goal-oriented fold design.
翻译:我们将快速高斯进程一对一的计算公式普遍化为多倍交叉校准公式,在大环境下又强调交叉校准剩余值的共差结构。采用的方法,依靠Schur补充的区块矩阵反转,适用于简单和通用克里吉框架。我们说明由此产生的共差如何影响模型诊断,以及如何首先适当地转换剩余值。此外,我们研究这些剩余值之间的依赖性会计如何影响基于交叉校准的比值估计比例参数。在两个不同的情况中发现,即规模估计和通过假冒相似性进行更广泛的共差参数估计,对交叉校验剩余值之间的共差进行校正后回溯至最大可能性估计或原始变化。拟议高斯进程多倍交叉校验剩余值的快速计算如何执行模型,所有R语种。数值实验强调我们方法的准确性以及它能够促成的大幅加速度。然而,对跨值估算值估算值和更广泛的共变差参数估算,纠正跨比值之间的共差结构值的比较,其明显性结果是,作为核心估算结果的快速计算方法,作为核心计算,作为核心估算结果的快速计算方法,在持续计算,在最终计算中,在持续计算中,在持续计算中,以最终估算的进度计算中,在持续计算中,在持续计算中,以核心计算,在持续进行。