With the advent of massive data sets much of the computational science and engineering communities have been moving toward data-driven approaches such as regression and classification. However, they present a significant challenge due to the increasing size, complexity and dimensionality of the problems. In this paper a multilevel Kriging method that scales well with the number of observations and dimensions is developed. A multilevel basis is constructed that is adapted to a kD-tree partitioning of the observations. Numerically unstable covariance matrices with large condition numbers are transformed into well conditioned multilevel matrices without compromising accuracy. Moreover, it is shown that the multilevel prediction $exactly$ solves the Best Linear Unbiased Predictor (BLUP), but is numerically stable. The multilevel method is tested on numerically unstable problems of up 25 dimensions. Numerical results show speedups of up to 42,050 for solving the BLUP problem but to the same accuracy than the traditional iterative approach.
翻译:随着大量数据组的出现,许多计算科学和工程界一直在向回归和分类等数据驱动方法迈进,然而,由于问题的规模、复杂性和维度不断增加,它们构成重大挑战。在本文件中,一种多层次的克里格方法与观测数量和尺寸的开发相匹配。构建了一个多层次的基础,以适应观测的 kD 树分隔。具有大量条件的数值不稳定的共变矩阵被转化成条件良好的多级矩阵,而不会损害准确性。此外,还表明,多层次的预测用美元解决了最佳线性不偏向预测器(BLUP),但数字上是稳定的。多层次方法在25个尺寸的数值不稳定问题上进行了测试。数字性结果显示高达42 050美元的加速速度,用于解决BLUP问题,但与传统的迭接方法的精确性相同。