Survival analysis is a technique to predict the times of specific outcomes, and is widely used in predicting the outcomes for intensive care unit (ICU) trauma patients. Recently, deep learning models have drawn increasing attention in healthcare. However, there is a lack of deep learning methods that can model the relationship between measurements, clinical notes and mortality outcomes. In this paper we introduce BERTSurv, a deep learning survival framework which applies Bidirectional Encoder Representations from Transformers (BERT) as a language representation model on unstructured clinical notes, for mortality prediction and survival analysis. We also incorporate clinical measurements in BERTSurv. With binary cross-entropy (BCE) loss, BERTSurv can predict mortality as a binary outcome (mortality prediction). With partial log-likelihood (PLL) loss, BERTSurv predicts the probability of mortality as a time-to-event outcome (survival analysis). We apply BERTSurv on Medical Information Mart for Intensive Care III (MIMIC III) trauma patient data. For mortality prediction, BERTSurv obtained an area under the curve of receiver operating characteristic curve (AUC-ROC) of 0.86, which is an improvement of 3.6% over baseline of multilayer perceptron (MLP) without notes. For survival analysis, BERTSurv achieved a concordance index (C-index) of 0.7. In addition, visualizations of BERT's attention heads help to extract patterns in clinical notes and improve model interpretability by showing how the model assigns weights to different inputs.
翻译:生存分析是一种预测具体结果时间的技术,广泛用于预测强化护理单位创伤患者的结果。最近,深层次学习模型在卫生保健领域引起越来越多的关注。然而,缺乏能够模拟测量、临床说明和死亡率结果之间关系的深层次学习方法。在本文件中,我们引入了BERTSurv, 这是一种深层次学习生存框架,将变异器双向编码显示作为语言代表模型,用于非结构化临床说明,用于死亡率预测和生存分析。我们还将临床测量纳入BERTSurv。随着二进制跨渗透输入模式(BCE)的损失,BERTSurv可以预测死亡率,作为二进制结果(混合预测),但是,由于部分的日志(PL)损失,BERTSurv预测, 将死亡率的概率作为时间-时间-事件的结果(皮肤分析模型)。我们将BERTSurv用于医疗信息Mart, 用于强化治疗三期(MIMIC III) 创伤病人数据。对于死亡率预测,BERSPRv 的稳定性分析模型, BARSB-MLAL 的测算模型, 的测算的模型是BSLALAL的测测测测测测测的模型。