When modelling competing risks survival data, several techniques have been proposed in both the statistical and machine learning literature. State-of-the-art methods have extended classical approaches with more flexible assumptions that can improve predictive performance, allow high dimensional data and missing values, among others. Despite this, modern approaches have not been widely employed in applied settings. This article aims to aid the uptake of such methods by providing a condensed compendium of competing risks survival methods with a unified notation and interpretation across approaches. We highlight available software and, when possible, demonstrate their usage via reproducible R vignettes. Moreover, we discuss two major concerns that can affect benchmark studies in this context: the choice of performance metrics and reproducibility.
翻译:在模拟相竞风险生存数据时,在统计和机器学习文献中都提出了几种技术:最先进的方法扩展了传统方法,采用了更灵活的假设,可以提高预测性能,允许提供高维度数据和缺失值等;尽管如此,在应用环境中尚未广泛采用现代方法;本条款的目的是通过提供经过统一标记和解释的竞争性生存方法简编,帮助采用这些方法;我们强调现有软件,并在可能情况下通过可复制的Rignettes展示其使用情况;此外,我们讨论了在这方面可影响基准研究的两个主要问题:选择性能衡量标准和可复制性。