Early prediction of patients at risk of clinical deterioration can help physicians intervene and alter their clinical course towards better outcomes. In addition to the accuracy requirement, early warning systems must make the predictions early enough to give physicians enough time to intervene. Interpretability is also one of the challenges when building such systems since being able to justify the reasoning behind model decisions is desirable in clinical practice. In this work, we built an interpretable model for the early prediction of various adverse clinical events indicative of clinical deterioration. The model is evaluated on two datasets and four clinical events. The first dataset is collected in a predominantly COVID-19 positive population at Stony Brook Hospital. The second dataset is the MIMIC III dataset. The model was trained to provide early warning scores for ventilation, ICU transfer, and mortality prediction tasks on the Stony Brook Hospital dataset and to predict mortality and the need for vasopressors on the MIMIC III dataset. Our model first separates each feature into multiple ranges and then uses logistic regression with lasso penalization to select the subset of ranges for each feature. The model training is completely automated and doesn't require expert knowledge like other early warning scores. We compare our model to the Modified Early Warning Score (MEWS) and quick SOFA (qSOFA), commonly used in hospitals. We show that our model outperforms these models in the area under the receiver operating characteristic curve (AUROC) while having a similar or better median detection time on all clinical events, even when using fewer features. Unlike MEWS and qSOFA, our model can be entirely automated without requiring any manually recorded features. We also show that discretization improves model performance by comparing our model to a baseline logistic regression model.
翻译:除了准确性要求外,早期预警系统必须提前作出预测,以便让医生有足够的时间进行干预。在临床实践中,解释性也是在建立这种系统时所面临的挑战之一,因为可以证明示范决定背后的推理是可取的。在这项工作中,我们建立了一个可解释的模型,用于早期预测各种不良临床事件,表明临床恶化。模型用两个数据集和四个临床事件来评估。第一个数据集是在斯托尼布罗克医院的以COVID-19为主的临床阳性人群中收集的。第二个数据集是完全的MIMIMICIII数据集。该模型在建立这种系统时也是挑战之一,因为能够证明示范决定临床恶化的风险。在MIMIC III数据集中,我们为早期预测各种不良临床事件的早期预测建立了一个可解释模型。我们的第一个模型把我们的模型分成不同的模型分为多个区域,然后用任何变价的逻辑回归法来选择每种地段的序列。模型是完全自动化的,并且需要ASMA III III 数据集,用于通风、电路段、电路路路段,而我们又使用其他预警工具。我们使用这些模型,可以使用这些模型,然后用这些模型,然后用更低模型,然后用更小的模型,然后用更精确的模型,然后用更精确的模型,然后用更精确的模型来选择测路路路路路路路路路路段,然后用。我们用一个比。