Latent Gaussian models and boosting are widely used machine learning techniques. Tree-boosting shows excellent predictive accuracy on many data sets, but potential drawbacks are that it assumes conditional independence of samples, produces discontinuous predictions for, and it can have difficulty with high-cardinality categorical variables. Latent Gaussian models, such as Gaussian process and grouped random effects models, are flexible prior models that allow for making probabilistic predictions. However, existing latent Gaussian models usually assume either a zero or a linear prior mean function. This article introduces a novel approach that combines boosting and latent Gaussian models to remedy the above-mentioned drawbacks and to leverage the advantages of both techniques. We obtain increased predictive accuracy compared to existing approaches in both simulated and real-world data experiments.
翻译:原始高斯模型和助推模型是广泛使用的机器学习技术。 植树催生模型在许多数据集中显示出极好的预测性准确性,但潜在的缺点是,它假定样品具有有条件的独立性,对高斯人绝对变量作出不连续的预测,而且可能难以找到高斯人绝对变量。 延高斯人模型,如高斯进程和组合随机效应模型,是灵活的先期模型,可以进行概率预测。 但是,现有的潜伏高斯人模型通常具有零或直线的前中值功能。 本条引入了一种新颖的方法,将振动模型和潜伏高斯人模型结合起来,以补救上述退步,并利用这两种技术的优势。 与模拟和现实世界数据实验中的现有方法相比,我们获得了更高的预测性准确性。