Leading up to August 2020, COVID-19 has spread to almost every country in the world, causing millions of infected and hundreds of thousands of deaths. In this paper, we first verify the assumption that clinical variables could have time-varying effects on COVID-19 outcomes. Then, we develop a temporal stratification approach to make daily predictions on patients' outcome at the end of hospital stay. Training data is segmented by the remaining length of stay, which is a proxy for the patient's overall condition. Based on this, a sequence of predictive models are built, one for each time segment. Thanks to the publicly shared data, we were able to build and evaluate prototype models. Preliminary experiments show 0.98 AUROC, 0.91 F1 score and 0.97 AUPR on continuous deterioration prediction, encouraging further development of the model as well as validations on different datasets. We also verify the key assumption which motivates our method. Clinical variables could have time-varying effects on COVID-19 outcomes. That is to say, the feature importance of a variable in the predictive model varies at different disease stages.
翻译:到2020年8月为止,COVID-19已经扩散到世界上几乎每一个国家,造成数百万人感染和数十万人死亡。在本文中,我们首先核实临床变量可能对COVID-19结果产生时间变化效应的假设。然后,我们制定了一种时间分层法,每天预测住院期结束时病人的结果。培训数据被剩余停留时间的长度分割,这是病人总体状况的替代物。在此基础上,建立了一系列预测模型,每个时段一个时段。由于公开分享的数据,我们得以建立和评估原型模型。初步实验显示0.98 AUROC、0.91 F1分和0.97 AUPR关于持续恶化预测的0.98 AUROC、0.91 F1分和0.97 AUPR,鼓励进一步发展模型,以及验证不同的数据集。我们还核查了我们方法所依据的关键假设。临床变量可能对COVID-19结果产生时间变化的影响。也就是说,预测模型中变量在不同疾病阶段的特点。