Electronic health records (EHRs) contain patients' heterogeneous data that are collected from medical providers involved in the patient's care, including medical notes, clinical events, laboratory test results, symptoms, and diagnoses. In the field of modern healthcare, predicting whether patients would experience any risks based on their EHRs has emerged as a promising research area, in which artificial intelligence (AI) plays a key role. To make AI models practically applicable, it is required that the prediction results should be both accurate and interpretable. To achieve this goal, this paper proposed a label-dependent and event-guided risk prediction model (LERP) to predict the presence of multiple disease risks by mainly extracting information from unstructured medical notes. Our model is featured in the following aspects. First, we adopt a label-dependent mechanism that gives greater attention to words from medical notes that are semantically similar to the names of risk labels. Secondly, as the clinical events (e.g., treatments and drugs) can also indicate the health status of patients, our model utilizes the information from events and uses them to generate an event-guided representation of medical notes. Thirdly, both label-dependent and event-guided representations are integrated to make a robust prediction, in which the interpretability is enabled by the attention weights over words from medical notes. To demonstrate the applicability of the proposed method, we apply it to the MIMIC-III dataset, which contains real-world EHRs collected from hospitals. Our method is evaluated in both quantitative and qualitative ways.
翻译:电子健康记录(EHRs)包含从参与病人护理的医疗提供者那里收集的病人不同种类的数据,包括医疗说明、临床事件、实验室测试结果、症状和诊断。在现代保健领域,预测病人是否会经历基于其EHRs的任何风险,这已成为一个前景良好的研究领域,人工智能(AI)在其中发挥着关键作用。要使AI模型实际适用,就需要预测结果既准确又可解释。为实现这一目标,本文件建议采用一个基于标签和事件指导的风险预测模型(LERP),主要通过从非结构化医疗说明中提取信息,预测多种疾病风险的存在。在现代保健领域,我们的模式表现在以下几个方面。首先,我们采用一个基于标签的机制,更多地注意医学说明中的词语,这些词在性质上与风险标签标签标签标签标签标签标签(AI)名称相似。第二,临床事件(如治疗和药物)还可以显示病人的健康状况,我们利用事件信息并利用这些模型来预测多种疾病风险的存在,主要从非结构化医学说明中产生事件指导性的数据。第三,通过标签和医学说明的方式来解释准确性地解释医学说明。