Real-time individual endpoint prediction has always been a challenging task but of great clinic utility for both patients and healthcare providers. With 6,879 chronic kidney disease stage 4 (CKD4) patients as a use case, we explored the feasibility and performance of gated recurrent units with decay that models Weibull probability density function (GRU-D-Weibull) as a semi-parametric longitudinal model for real-time individual endpoint prediction. GRU-D-Weibull has a maximum C-index of 0.77 at 4.3 years of follow-up, compared to 0.68 achieved by competing models. The L1-loss of GRU-D-Weibull is ~66% of XGB(AFT), ~60% of MTLR, and ~30% of AFT model at CKD4 index date. The average absolute L1-loss of GRU-D-Weibull is around one year, with a minimum of 40% Parkes serious error after index date. GRU-D-Weibull is not calibrated and significantly underestimates true survival probability. Feature importance tests indicate blood pressure becomes increasingly important during follow-up, while eGFR and blood albumin are less important. Most continuous features have non-linear/parabola impact on predicted survival time, and the results are generally consistent with existing knowledge. GRU-D-Weibull as a semi-parametric temporal model shows advantages in built-in parameterization of missing, native support for asynchronously arrived measurement, capability of output both probability and point estimates at arbitrary time point for arbitrary prediction horizon, improved discrimination and point estimate accuracy after incorporating newly arrived data. Further research on its performance with more comprehensive input features, in-process or post-process calibration are warranted to benefit CKD4 or alike terminally-ill patients.
翻译:实时个人端点预测一直是一项具有挑战性的任务,但对于患者和医疗保健提供者来说都具有巨大的临床效用。 以6 879个慢性肾病第4阶段(CKD4)患者为使用案例,我们探索了有衰变的封闭性经常性单位的可行性和性能,这些单位的模型是Wibull概率密度功能(GRU-D-Weibull),作为实时个人端点预测的半参数长视模型。 GRU-D-Weibull在后续跟踪中的最大Cindex为0.77,而相竞争模型为0.68。 GRU-D-Webull的L1损失是XGB(AFT)点的~66%,MTLR的~60%,以及CKFT模型在CKD4指数日期的半分数。 GRU-D-Webbull的平均绝对损失是一年左右的模型,在指数日期后至少为40%的Parkes 严重错误。 GRU-D-Webbul没有校准, 而G-web-bill的测算的Lisal-al-al-lation-lation-al-al-lation 的测值是准确值的Lislation-lation-lation-lation-de dislight dislation 和精确值的精确点的精确点的精确值估算值估算值的精确值的精确值估算值估算值, 和精确值的精确值的精确值估算值是越来越算值, 。 和不断算值的测算算值 。