Making the most use of abundant information in electronic health records (EHR) is rapidly becoming an important topic in the medical domain. Recent work presented a promising framework that embeds entire features in raw EHR data regardless of its form and medical code standards. The framework, however, only focuses on encoding EHR with minimal preprocessing and fails to consider how to learn efficient EHR representation in terms of computation and memory usage. In this paper, we search for a versatile encoder not only reducing the large data into a manageable size but also well preserving the core information of patients to perform diverse clinical tasks. We found that hierarchically structured Convolutional Neural Network (CNN) often outperforms the state-of-the-art model on diverse tasks such as reconstruction, prediction, and generation, even with fewer parameters and less training time. Moreover, it turns out that making use of the inherent hierarchy of EHR data can boost the performance of any kind of backbone models and clinical tasks performed. Through extensive experiments, we present concrete evidence to generalize our research findings into real-world practice. We give a clear guideline on building the encoder based on the research findings captured while exploring numerous settings.
翻译:电子健康记录(EHR)中大量信息的利用正在迅速成为医疗领域的一个重要主题。最近的工作提出了一个充满希望的框架,将全部特征嵌入原始EHR数据,而不论其形式和医学代码标准如何。然而,该框架只侧重于以最低的预处理方式对EHR进行编码,而没有考虑如何在计算和记忆使用方面学习有效的EHR代表性。在本文件中,我们寻找一个多功能的编码器,不仅将大型数据缩小到可以控制的规模,而且很好地保存病人的核心信息,以便执行不同的临床任务。我们发现,分级结构的Convolual Neural网络(CNN)往往超越了重建、预测和生成等各种任务的最新模型,即使参数更少,培训时间也更少。此外,我们发现,利用EHR数据固有的等级可以提高任何类型主干模型和临床任务的业绩。通过广泛的实验,我们提出了具体的证据,将我们的研究成果归纳为现实世界实践。我们给出了在探索无数环境时根据所捕捉到的研究结果构建编码器的明确准则。</s>