Early prediction of mortality and length of stay(LOS) of a patient is vital for saving a patient's life and management of hospital resources. Availability of electronic health records(EHR) makes a huge impact on the healthcare domain and there has seen several works on predicting clinical problems. However, many studies did not benefit from the clinical notes because of the sparse, and high dimensional nature. In this work, we extract medical entities from clinical notes and use them as additional features besides time-series features to improve our predictions. We propose a convolution based multimodal architecture, which not only learns effectively combining medical entities and time-series ICU signals of patients, but also allows us to compare the effect of different embedding techniques such as Word2vec, FastText on medical entities. In the experiments, our proposed method robustly outperforms all other baseline models including different multimodal architectures for all clinical tasks. The code for the proposed method is available at https://github.com/tanlab/ConvolutionMedicalNer.
翻译:早期预测病人的死亡率和停留时间(LOS)对于挽救病人的生命和管理医院资源至关重要。电子健康记录(EHR)的提供对保健领域产生了巨大影响,并出现了若干预测临床问题的工作。然而,许多研究没有从临床记录中受益,因为病情稀少和高度多维性质。在这项工作中,我们从临床记录中提取医疗实体,并把它们作为改进我们预测的时间序列特征之外的附加特征。我们提议了一个基于变革的多式联运结构,不仅学会有效地将医疗实体和病人的时间序列ICU信号结合起来,而且还使我们能够比较诸如Word2vec等不同嵌入技术对医疗实体的影响。在实验中,我们提出的方法大大超越了所有其他基线模型,包括所有临床任务的不同多式联运结构。拟议方法的代码可在https://github.com/tanlab/ConvolumicalMedicalNer查阅。