Deep models trained through maximum likelihood have achieved state-of-the-art results for survival analysis. Despite this training scheme, practitioners evaluate models under other criteria, such as binary classification losses at a chosen set of time horizons, e.g. Brier score (BS) and Bernoulli log likelihood (BLL). Models trained with maximum likelihood may have poor BS or BLL since maximum likelihood does not directly optimize these criteria. Directly optimizing criteria like BS requires inverse-weighting by the censoring distribution. However, estimating the censoring model under these metrics requires inverse-weighting by the failure distribution. The objective for each model requires the other, but neither are known. To resolve this dilemma, we introduce Inverse-Weighted Survival Games. In these games, objectives for each model are built from re-weighted estimates featuring the other model, where the latter is held fixed during training. When the loss is proper, we show that the games always have the true failure and censoring distributions as a stationary point. This means models in the game do not leave the correct distributions once reached. We construct one case where this stationary point is unique. We show that these games optimize BS on simulations and then apply these principles on real world cancer and critically-ill patient data.
翻译:通过最大可能性培训的深层模型已经达到了最新的生存分析结果。尽管有了这一培训计划,实践者还是根据其他标准评估模型,如在选定的时间范围,例如Brier评分(BS)和Bernoulli日志(BLL),在选定的一套时间范围,例如Brier评分(BBS)和BNURL 日志(BLL),在选定的一组时间范围,例如Brier评分(BLL)和Bernoulli日志(BLL),在最大可能性下培训的模型可能有较差的BS或BLLL,因为最大可能性不能直接优化这些标准。BS等直接优化标准需要检查分布的反比重。然而,根据这些指标估算审查模式需要用失败分布进行反加权。每个模型都需要用其他方法来进行反加权。每个模型的目标都需要用其他方法来进行二分级分类,但两者都不清楚。为了解决这一难题,我们引入了逆重度生存游戏。在这些游戏中,每种模型都是根据重重估的模型,然后将这些模型和精确地进行模拟。我们所选的世界。我们对这些模型进行模拟。