Early diagnosis of disease can lead to improved health outcomes, including higher survival rates and lower treatment costs. With the massive amount of information available in electronic health records (EHRs), there is great potential to use machine learning (ML) methods to model disease progression aimed at early prediction of disease onset and other outcomes. In this work, we employ recent innovations in neural ODEs combined with rich semantic embeddings of clinical codes to harness the full temporal information of EHRs. We propose ICE-NODE (Integration of Clinical Embeddings with Neural Ordinary Differential Equations), an architecture that temporally integrates embeddings of clinical codes and neural ODEs to learn and predict patient trajectories in EHRs. We apply our method to the publicly available MIMIC-III and MIMIC-IV datasets, and we find improved prediction results compared to state-of-the-art methods, specifically for clinical codes that are not frequently observed in EHRs. We also show that ICE-NODE is more competent at predicting certain medical conditions, like acute renal failure, pulmonary heart disease and birth-related problems, where the full temporal information could provide important information. Furthermore, ICE-NODE is also able to produce patient risk trajectories over time that can be exploited for further detailed predictions of disease evolution.
翻译:疾病早期诊断可以改善健康结果,包括提高存活率和降低治疗费用。由于电子健康记录(EHRs)提供了大量信息,因此极有可能使用机器学习(ML)方法模拟疾病进展,以早期预测疾病发病和其他结果。在这项工作中,我们运用了神经组织的最新创新,加上丰富的临床代码的语义嵌入,以利用EHRs的全部时间信息。我们提议ICE-NODE(将临床嵌入与神经普通差异等同相结合),这一结构在时间上结合了临床代码和神经值的嵌入,以学习和预测EHRs中的病人轨迹。我们运用了我们的方法,用于公开提供的MIMIM-III和MIMIM-IV数据集,我们找到了更好的预测结果,与最新医学方法相比,特别是临床代码在EHRs中并不经常观察到。我们还表明,ICE-NODE更有能力预测某些医疗条件,如急性肾脏衰竭、Pulual des 能够对IMDE疾病进行全面预测,从而产生与IMDE相关的时间风险。