生成具有多种数据类型和制约因素的电子健康记录 (Generating Electronic Health Records with Multiple Data Types and Constraints)

Sharing electronic health records (EHRs) on a large scale may lead to privacy intrusions. Recent research has shown that risks may be mitigated by simulating EHRs through generative adversarial network (GAN) frameworks. Yet the methods developed to date are limited because they 1) focus on generating data of a single type (e.g., diagnosis codes), neglecting other data types (e.g., demographics, procedures or vital signs) and 2) do not represent constraints between features. In this paper, we introduce a method to simulate EHRs composed of multiple data types by 1) refining the GAN model, 2) accounting for feature constraints, and 3) incorporating utility measures for such generation tasks. The findings over 770K EHRs from Vanderbilt University Medical Center demonstrate that our model achieved higher data utilities in retaining the basic statistics, interdimensional correlation, structural properties and frequent association rules from real data. Importantly, these were done without sacrificing privacy.

翻译：最近的研究表明,通过基因对抗网络(GAN)框架模拟EHR可以减轻风险,然而,迄今为止制定的方法是有限的,因为它们1 侧重于生成单一类型的数据(例如诊断代码),忽视其他类型的数据(例如人口、程序或生命迹象)和2,并不代表各种特征之间的制约;在本文件中,我们采用了一种方法来模拟由多种数据类型组成的EHR(1) 改进GAN模型,2 说明特征制约,3) 纳入这种生成任务的实用性措施。 Vanderbilt大学医疗中心的770K EHR的调查结果表明,我们的模型在保留基本统计数据、维维系相关性、结构属性和从真实数据中经常使用关联规则方面实现了更高的数据效用。重要的是,这些是在不牺牲隐私的情况下完成的。

相关内容

范德堡大学

关注 0

范德堡大学（Vanderbilt University），是位于美国田纳西州纳什维尔市的一所私立研究型大学。学校创立于1873年，是闻名全美的名牌大学，也是位于美国南方的少数的顶级名校之一。范德堡大学的最大特色是CPLE，即Course Programin Liberal Education，各学生不论修读任何学系，均须修读人文学科、基本科学、历史及外文，以符合CPLE的宗旨，即人文教育。

【斯坦福】从电子病历EHR构建知识图谱，Robustly Extracting Medical Knowledge from EHRs:A Case Study of Learning a Health Knowledge Graph

专知会员服务

56+阅读 · 2020年6月2日

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日