In this article, we propose a novel model, PrivGenDB, for securely storing and efficiently conducting different queries on genomic data outsourced to an honest-but-curious cloud server. To instantiate PrivGenDB, we use searchable symmetric encryption (SSE) to ensure confidentiality while providing the required functionality. To the best of our knowledge, PrivGenDB construction is the first SSE-based approach ensuring the confidentiality of shared Single Nucleotide Polymorphism (SNP)-phenotype data through encryption while making the computation/query process efficient and scalable for biomedical research and care. It supports a variety of query types on genomic data, including count queries, Boolean queries, and k'-out-of-k match queries. Finally, the PrivGenDB model not only can handle the dataset containing both genotype and phenotype, but it also supports storing and managing other metadata like gender and ethnicity privately. Computer evaluations on a dataset with 5,000 records and 1,000 SNPs demonstrate that a count/Boolean query and a k'-out-of-k match query over 40 SNPs take approximately 4.3s and 86.4{\mu}s, respectively, that outperforms the existing schemes.
翻译:在本篇文章中,我们提出了一个新颖的模型,即PriivGenDB, 用于安全地储存和高效地对外包给诚实的但有说服力的云服务器的基因组数据进行不同的查询。对于PrivGenDB,我们使用可搜索的对称加密(SSE)来确保保密,同时提供所需的功能。根据我们的最佳知识,PriivGenDB的建设是第一个基于SSESE的方法,通过加密确保共享的单一核糖化多元形态(SNP)-phenod型数据的保密性,同时使计算/查询过程高效和可扩缩到生物医学研究和护理中。它支持关于基因组数据的各种查询类型,包括计数查询、Boolean查询和K'out-out-k匹配查询。最后,PriivGenDB模型不仅能够处理包含基因型和phone类型的数据数据集,而且还支持存储和管理诸如性别和种族私有等其他元数据。一个数据集的计算机评价,有5,000个记录和1,000个SNPSNPS的计算机评估显示,一个计数/Boolean查询和大约40个KMyS-moxS-rops。