Genome-wide Association Studies (GWASes) identify genomic variations that are statistically associated with a trait, such as a disease, in a group of individuals. Unfortunately, careless sharing of GWAS statistics might give rise to privacy attacks. Several works attempted to reconcile secure processing with privacy-preserving releases of GWASes. However, we highlight that these approaches remain vulnerable if GWASes utilize overlapping sets of individuals and genomic variations. In such conditions, we show that even when relying on state-of-the-art techniques for protecting releases, an adversary could reconstruct the genomic variations of up to 28.6% of participants, and that the released statistics of up to 92.3% of the genomic variations would enable membership inference attacks. We introduce I-GWAS, a novel framework that securely computes and releases the results of multiple possibly interdependent GWASes. I-GWAScontinuously releases privacy-preserving and noise-free GWAS results as new genomes become available.
翻译:全基因组协会研究(GWAS)确定一组人中与疾病等特征有统计联系的基因组变异。不幸的是,不小心分享GWAS统计数据可能导致隐私攻击。一些工作试图协调安全处理与GWAS的隐私保护释放。然而,我们强调,如果GWAS使用重叠的一组个人和基因变异,这些办法仍然很脆弱。在这种情况下,我们表明,即使依靠最先进的技术来保护释放,对手也可以重建28.6%的参与者的基因变异,而所公布的高达92.3%的基因变异统计数据将使人们能够推断成成员的攻击。我们引入了I-GWAS,这是一个新的框架,可以安全地比较和释放多种可能相互依存的GWAS的结果。在新的基因组出现时,I-GWAS持续不懈地释放保护隐私和无噪音的GWAS结果。