Existing prognostic tools mainly focus on predicting the risk of mortality among patients with coronavirus disease 2019. However, clinical evidence suggests that COVID-19 can result in non-mortal complications that affect patient prognosis. To support patient risk stratification, we aimed to develop a prognostic system that predicts complications common to COVID-19. In this retrospective study, we used data collected from 3,352 COVID-19 patient encounters admitted to 18 facilities between April 1 and April 30, 2020, in Abu Dhabi (AD), UAE. The hospitals were split based on geographical proximity to assess for our proposed system's learning generalizability, AD Middle region and AD Western & Eastern regions, A and B, respectively. Using data collected during the first 24 hours of admission, the machine learning-based prognostic system predicts the risk of developing any of seven complications during the hospital stay. The complications include secondary bacterial infection, AKI, ARDS, and elevated biomarkers linked to increased patient severity, including d-dimer, interleukin-6, aminotransferases, and troponin. During training, the system applies an exclusion criteria, hyperparameter tuning, and model selection for each complication-specific model. The system achieves good accuracy across all complications and both regions. In test set A (587 patient encounters), the system achieves 0.91 AUROC for AKI and >0.80 AUROC for most of the other complications. In test set B (225 patient encounters), the respective system achieves 0.90 AUROC for AKI, elevated troponin, and elevated interleukin-6, and >0.80 AUROC for most of the other complications. The best performing models, as selected by our system, were mainly gradient boosting models and logistic regression. Our results show that a data-driven approach using machine learning can predict the risk of such complications with high accuracy.
翻译:现有预测工具主要侧重于预测2019年冠状病毒疾病患者的死亡率风险。然而,临床证据表明,COVID-19可能导致影响患者预感的非死亡并发症。为支持患者风险分级,我们的目标是开发一个预测系统,预测COVID-19常见并发症。在本次回顾研究中,我们使用了从3 352 COVID-19患者于2020年4月1日至30日在阿布扎比(AD)的18个设施中接收的数据。医院根据地理距离进行分裂,以评估我们提议的系统是否具有通用性、AD中区和AD West & East区域、A和B。利用在入院头24小时内收集的数据,基于机器学习的预测系统预测在住院期间出现任何7种并发症的风险。并发症包括二级细菌感染、AKIKI、ARDDS、与病人更严重程度相关的生物标志,包括d-dimer、Interleuk-6、氨基转移酶、A和Arentral-rental 等系统在A系统内部测试中,每个测试系统都采用A-rmaleximaleximaleximal 数据,以及Bstation Arevalemissemex 测试系统,每个测试系统都显示。