Boosting Trees are one of the most successful statistical learning approaches that involve sequentially growing an ensemble of simple regression trees (i.e., "weak learners"). However, gradient boosted trees are not yet available for spatially correlated data. This paper proposes a new gradient Boosted Trees algorithm for Spatial Data (Boost-S) with covariate information. Boost-S integrates the spatial correlation structure into the classical framework of gradient boosted trees. Each tree is grown by solving a regularized optimization problem, where the objective function involves two penalty terms on tree complexity and takes into account the underlying spatial correlation. A computationally-efficient algorithm is proposed to obtain the ensemble trees. The proposed Boost-S is applied to the spatially-correlated FDG-PET (fluorodeoxyglucose-positron emission tomography) imaging data collected during cancer chemoradiotherapy. Our numerical investigations successfully demonstrate the advantages of the proposed Boost-S over existing approaches for this particular application.
翻译:扶植树是最成功的统计学习方法之一,它涉及相继种植一系列简单的回归树(即“弱学习者”)。然而,对于空间相关数据来说,梯度增殖树还没有可用。本文件提出一个新的具有共变信息的空间数据梯度增殖树算法(Boost-S) 。促进-S 将空间相关结构纳入梯度增生树的经典框架。每棵树都是通过解决常规化优化问题而生长的,其中目标功能涉及对树复杂性的两个惩罚条件,并考虑到潜在的空间相关关系。提议采用计算效率算法来获取共振树。拟议的引力-S 应用于在癌症化学疗法期间收集的空间相关FDG-PET(含脱氧甘糖-粒子排放图解析)成像数据。我们的数字调查成功地展示了拟议波纹-S相对于这一特定应用的现有方法的优势。