Gaussian process regression (GPR) is a fundamental model used in machine learning. Owing to its accurate prediction with uncertainty and versatility in handling various data structures via kernels, GPR has been successfully used in various applications. However, in GPR, how the features of an input contribute to its prediction cannot be interpreted. Herein, we propose GPR with local explanation, which reveals the feature contributions to the prediction of each sample, while maintaining the predictive performance of GPR. In the proposed model, both the prediction and explanation for each sample are performed using an easy-to-interpret locally linear model. The weight vector of the locally linear model is assumed to be generated from multivariate Gaussian process priors. The hyperparameters of the proposed models are estimated by maximizing the marginal likelihood. For a new test sample, the proposed model can predict the values of its target variable and weight vector, as well as their uncertainties, in a closed form. Experimental results on various benchmark datasets verify that the proposed model can achieve predictive performance comparable to those of GPR and superior to that of existing interpretable models, and can achieve higher interpretability than them, both quantitatively and qualitatively.
翻译:Gausian 进程回归(GPR)是机器学习中使用的基本模型。由于对通过内核处理各种数据结构的准确预测具有不确定性和多功能性,GPR被成功地应用于各种应用中。然而,在GPR中,投入的特点如何有助于预测,无法解释。在这里,我们建议GPR加上本地解释,其中显示对每个样本预测的特征贡献,同时保持GPR的预测性能。在拟议的模型中,每个样本的预测和解释都使用易于解释的本地线性模型进行。当地线性模型的重量矢量假设来自多种变式高斯进程前期。拟议模型的超分数通过最大限度地增加边际可能性来估计。对于新的测试样本,拟议的模型可以以封闭的形式预测其目标变量和重量矢量的值及其不确定性。各种基准数据集的实验结果证实,拟议的模型能够实现与GPR的预测性业绩相比,并且比现有可解释模型的高级性,并且能够实现更高的质量和可解释性。