Harrel's concordance index is a commonly used discrimination metric for survival models, particularly for models where the relative ordering of the risk of individuals is time-independent, such as the proportional hazards model. There are several suggestions, but no consensus, on how it could be extended to models where risk varies over time, e.g.\ in case of crossing hazard rates. We show that, in the limit, concordance is maximized if and only if the risk score is concordant with the hazard rate, in the sense that for a comparable pair where the first event time is observed, the risk score is concordant with the hazard rate at this first event time. Thus, we suggest using the hazard rate as the risk score when calculating concordance. Through simulations, we demonstrate situations in which other concordance indices can lead to incorrect models being selected over a true model, justifying the use of our suggested risk prediction in both model selection and in loss functions in, e.g., machine learning models.
翻译:哈雷尔的和谐指数是生存模型中常用的一种歧视衡量标准,特别是个人风险相对顺序取决于时间的模型,如比例危害模型。对于如何将其推广到风险随时间而变化的模型,例如,在跨度危险率方面,有好几项建议,但没有达成共识。我们表明,在极限情况下,只有在风险分数与危险率一致的情况下,才最大限度地实现和谐,也就是说,对于观察到第一次事故的可比对子而言,风险分数与第一次事故发生时的危险率一致。因此,我们建议使用危险率作为计算一致性时的风险分数。通过模拟,我们证明其他一致指数可能导致在真实模型中选择不正确的模型,证明在模型选择和机器学习模型中损失功能方面使用我们所建议的风险预测是合理的。