Semi-competing risks data arise when both non-terminal and terminal events are considered in a model. Such data with multiple events of interest are frequently encountered in medical research and clinical trials. In this framework, terminal event can censor the non-terminal event but not vice versa. It is known that variable selection is practical in identifying significant risk factors in high-dimensional data. While some recent works on penalized variable selection deal with these competing risks separately without incorporating possible correlation between them, we perform variable selection in an illness-death model using shared frailty where semiparametric hazard regression models are used to model the effect of covariates. We propose a broken adaptive ridge (BAR) penalty to encourage sparsity and conduct extensive simulation studies to compare its performance with other popular methods. We perform variable selection in an event specific manner so that the potential risk factors and covariates effects can be estimated and selected, simultaneously corresponding to each event in the study. The grouping effect, as well as the oracle property of the proposed BAR procedure are investigated using simulation studies. The proposed method is then applied to real-life data arising from a Colon Cancer study.
翻译:在模型中考虑非终点和终点事件时,会产生半竞争风险数据。在医学研究和临床试验中,经常会遇到这种具有多重关注事件的数据。在这个框架内,终点事件可以审查非终点事件,反之亦然。已知在确定高维数据中的重大风险因素时,选择变数是实用的。虽然最近一些关于惩罚性变量选择的工作分别处理这些相互竞争的风险,而没有纳入它们之间的可能关联,但我们在疾病死亡模型中采用共同的弱点选择,在使用半参数危险回归模型来模拟复变效应的疾病死亡模型中进行变数选择。我们提议了一种破碎的适应性脊(BAR)处罚,以鼓励松散,并进行广泛的模拟研究,以便与其他流行方法进行比较。我们以特定方式进行变量选择,以便估计和选择潜在的风险因素和共变数效应,同时与研究中的每一事件相对应。我们使用模拟研究对组合效应以及拟议巴氏癌症程序的骨质特性进行调查。然后将拟议方法应用于科隆癌症研究产生的实体生命数据。