Low-rank approximation is a popular strategy to tackle the "big n problem" associated with large-scale Gaussian process regressions. Basis functions for developing low-rank structures are crucial and should be carefully specified. Predictive processes simplify the problem by inducing basis functions with a covariance function and a set of knots. The existing literature suggests certain practical implementations of knot selection and covariance estimation; however, theoretical foundations explaining the influence of these two factors on predictive processes are lacking. In this paper, the asymptotic prediction performance of the predictive process and Gaussian process predictions is derived and the impacts of the selected knots and estimated covariance are studied. We suggest the use of support points as knots, which best represent data locations. Extensive simulation studies demonstrate the superiority of support points and verify our theoretical results. Real data of precipitation and ozone are used as examples, and the efficiency of our method over other widely used low-rank approximation methods is verified.
翻译:低调近似值是解决与大规模高斯进程回归有关的“大问题”的流行战略,发展低级结构的基础功能至关重要,应当仔细规定。预测过程通过引导基础功能来简化问题,带有共差功能和一组结节。现有文献表明,结结选和共差估计的某些实际执行情况;然而,缺乏解释这两个因素对预测过程的影响的理论基础。在本文中,预测过程和高斯进程预测的无药可治性预测性业绩以及选定结节和估计共差的影响得到了研究。我们建议使用支持点作为结点,这最能代表数据位置。广泛的模拟研究显示了支持点的优势并核实了我们的理论结果。降水和臭氧的实际数据被作为实例使用,并核实了我们方法相对于其他广泛使用的低级近似方法的效率。