The use of cumulative incidence functions for characterizing the risk of one type of event in the presence of others has become increasingly popular over the past decade. The problems of modeling, estimation and inference have been treated using parametric, nonparametric and semi-parametric methods. Efforts to develop suitable extensions of machine learning methods, such as regression trees and related ensemble methods, have begun comparatively recently. In this paper, we propose a novel approach to estimating cumulative incidence curves in a competing risks setting using regression trees and associated ensemble estimators. The proposed methods employ augmented estimators of the Brier score risk as the primary basis for building and pruning trees, and lead to methods that are easily implemented using existing R packages. Data from the Radiation Therapy Oncology Group (trial 9410) is used to illustrate these new methods.
翻译:在过去十年中,利用累积发生率函数来说明某一类事件在其他人在场的情况下的风险越来越普遍,模型、估计和推论的问题已经用参数、非参数和半参数方法处理,开发适当的机器学习方法扩展,例如回归树和相关组合方法的努力最近才开始,在本文件中,我们提议采用一种新颖的办法,在使用回归树和相关共同估计器的相互竞争的风险设置中估计累积发生率曲线。提议的方法采用强化的Brier评分器作为建造和砍伐树木的主要基础,并导致采用易于使用现有R包的方法。使用辐射治疗肿瘤小组的数据(第9410号审判)来说明这些新方法。