Sepsis is a life-threatening disease with high morbidity, mortality and healthcare costs. The early prediction and administration of antibiotics and intravenous fluids is considered crucial for the treatment of sepsis and can save potentially millions of lives and billions in health care costs. Professional clinical care practitioners have proposed clinical criterion which aid in early detection of sepsis; however, performance of these criterion is often limited. Clinical text provides essential information to estimate the severity of the sepsis in addition to structured clinical data. In this study, we explore how clinical text can complement structured data towards early sepsis prediction task. In this paper, we propose multi modal model which incorporates both structured data in the form of patient measurements as well as textual notes on the patient. We employ state-of-the-art NLP models such as BERT and a highly specialized NLP model in Amazon Comprehend Medical to represent the text. On the MIMIC-III dataset containing records of ICU admissions, we show that by using these notes, one achieves an improvement of 6.07 points in a standard utility score for Sepsis prediction and 2.89% in AUROC score. Our methods significantly outperforms a clinical criteria suggested by experts, qSOFA, as well as the winning model of the PhysioNet Computing in Cardiology Challenge for predicting Sepsis.
翻译:早期预测和服用抗生素和静脉注射液被认为是治疗败血症的关键,可以挽救数百万人的生命和数十亿医疗费用; 专业临床护理从业人员提出了有助于早期发现败血症的临床标准; 然而,这些标准的绩效往往有限; 临床文本除了提供结构化的临床数据外,还提供了评估败血症严重性的基本信息; 在这次研究中,我们探讨了临床文本如何补充结构化数据以完成早期败血症预测任务。 在本文中,我们提出了多模式模型,既包括结构化数据,又可以挽救数百万人的生命和数十亿医疗费用; 专业临床护理从业人员提出了有助于早期检测败血症的临床标准; 然而,这些标准的性能往往有限; 临床文本提供了基本信息,用以估计败血症的严重程度; 在包含重症综合症患者入院记录的MIMI-III数据集中,我们通过这些说明,改进了Sepsi预测标准中的6.07点,在AUROC中增加了2.89%。 我们采用最先进的NLP模型,作为SEVALFA的临床预测标准,我们提出了在CUDA的模型中大大超越了CUDA的模型。