Gaussian 模型促进 (Latent Gaussian Model Boosting)

Latent Gaussian models and boosting are widely used techniques in statistics and machine learning. Tree-boosting shows excellent predictive accuracy on many data sets, but potential drawbacks are that it assumes conditional independence of samples, produces discontinuous predictions for, e.g., spatial data, and it can have difficulty with high-cardinality categorical variables. Latent Gaussian models, such as Gaussian process and grouped random effects models, are flexible prior models that allow for making probabilistic predictions. However, existing latent Gaussian models usually assume either a zero or a linear prior mean function which can be an unrealistic assumption. This article introduces a novel approach that combines boosting and latent Gaussian models in order to remedy the above-mentioned drawbacks and to leverage the advantages of both techniques. We obtain increased predictive accuracy compared to existing approaches in both simulated and real-world data experiments.

翻译：原始高斯模型和推进是统计和机器学习中广泛使用的技术。植树催生显示许多数据集的预测准确性极强,但潜在的缺点是,它假定样品有条件独立,对空间数据等数据作出不连续的预测,而且它可能难以应付高心绝对变量。延迟高斯模型,如高斯进程和组合随机效应模型,是灵活的先期模型,可以进行概率预测。但是,现有的潜伏高斯模型通常假定零或线性前中值函数,这可能是不现实的假设。文章介绍了一种新颖的方法,将推进模型和潜伏高斯模型结合起来,以纠正上述缺陷,并利用这两种技术的优势。与模拟和现实世界数据实验中的现有方法相比,我们获得了更高的预测准确性。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/