隐私保护联邦式差分隐私DNA模体发现 (Privacy-Preserving Federated Discovery of DNA Motifs with Differential Privacy) - 专知论文

会员服务 ·

0

模体发现 · 数据孤岛 · 差分 · 差分隐私 · 联邦学习 ·

2023 年 4 月 4 日

Privacy-Preserving Federated Discovery of DNA Motifs with Differential Privacy

翻译：隐私保护联邦式差分隐私DNA模体发现

Yao Chen,Wensheng Gan,Gengsen Huang,Yongdong Wu,Philip S. Yu

from arxiv, Preprint. 7 figures, 1 table

DNA motif discovery is an important issue in gene research, which aims to identify transcription factor binding sites (i.e., motifs) in DNA sequences to reveal the mechanisms that regulate gene expression. However, the phenomenon of data silos and the problem of privacy leakage have seriously hindered the development of DNA motif discovery. On the one hand, the phenomenon of data silos makes data collection difficult. On the other hand, the collection and use of DNA data become complicated and difficult because DNA is sensitive private information. In this context, how discovering DNA motifs under the premise of ensuring privacy and security and alleviating data silos has become a very important issue. Therefore, this paper proposes a novel method, namely DP-FLMD, to address this problem. Note that this is the first application of federated learning to the field of genetics research. The federated learning technique is used to solve the problem of data silos. It has the advantage of enabling multiple participants to train models together and providing privacy protection services. To address the challenges of federated learning in terms of communication costs, this paper applies a sampling method and a strategy for reducing communication costs to DP-FLMD. In addition, differential privacy, a privacy protection technique with rigorous mathematical proof, is also applied to DP-FLMD. Experiments on the DNA datasets show that DP-FLMD has high mining accuracy and runtime efficiency, and the performance of the algorithm is affected by some parameters.

翻译：DNA模体发现是基因研究的重要问题之一，旨在识别DNA序列中的转录因子结合位点（即模体），以揭示基因表达调节的机制。然而，数据孤岛现象和隐私泄露问题严重阻碍了DNA模体发现的发展。一方面，数据孤岛现象使数据采集困难。另一方面，DNA是敏感的私人信息，因此收集和使用DNA数据变得复杂而困难。在这种情况下，如何在确保隐私和安全的前提下发现DNA模体并减轻数据孤岛问题已成为一个非常重要的问题。因此，本文提出了一种新方法，即DP-FLMD，来解决问题。需要注意的是，DP-FLMD是联邦学习在基因学研究领域的首次应用。采用联邦学习技术解决数据孤岛问题，其优点是使多个参与者共同训练模型并提供隐私保护服务。为了解决联邦学习中的通信成本问题，本文采用了一种采样方法和通信成本降低策略应用于DP-FLMD。此外，本文还将差分隐私应用于DP-FLMD，这是一种具有严格数学证明的隐私保护技术。基于DNA数据集的实验表明，DP-FLMD具有较高的挖掘精度和运行时效率，并且算法的性能受到一些参数的影响。

0

相关内容

模体发现

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

专知会员服务

83+阅读 · 2023年5月1日

【MIT博士论文】联邦学习实用方法，143页pdf

【MIT博士论文】联邦学习实用方法，143页pdf

专知会员服务

66+阅读 · 2022年9月24日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

专知会员服务

17+阅读 · 2022年3月19日

联邦学习隐私保护研究进展

专知会员服务

94+阅读 · 2021年7月23日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

联邦学习安全与隐私保护研究综述

专知会员服务

127+阅读 · 2020年8月7日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

专知会员服务

15+阅读 · 2019年11月18日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

「联邦学习模型安全与隐私」研究进展

「联邦学习模型安全与隐私」研究进展

专知

5+阅读 · 2022年9月24日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

47+阅读 · 2020年12月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

联邦学习或将助力IoT走出“数据孤岛”？

联邦学习或将助力IoT走出“数据孤岛”？

中国计算机学会

20+阅读 · 2019年3月16日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

专知

10+阅读 · 2018年4月22日

DNA碱基剪切修复在Gadd45a促CD4+T细胞DNA低甲基化中的作用及参与SLE发病的机制

国家自然科学基金

0+阅读 · 2014年12月31日

PTEN/β-catenin/Nanog干细胞通路调控鼻咽癌放疗抵抗的实验研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cupriavidus basilensis B-8 对木质素降解机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

可见光活化分子氧产生羟基自由基降解有机污染物的新途径

国家自然科学基金

0+阅读 · 2012年12月31日

多角度、多目标靶向防治HPV16阳性子宫颈癌及相关机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

基于牛XY精子差异表达基因的性控DNA疫苗研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼻咽癌易感基因TNFRSF19致癌分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

果蔬拟除虫菊酯农药残留高效降解酶Cpde分子改造研究

国家自然科学基金

0+阅读 · 2008年12月31日

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Arxiv

0+阅读 · 2023年5月24日

Towards Fleet-wide Sharing of Wind Turbine Condition Information through Privacy-preserving Federated Learning

Arxiv

0+阅读 · 2023年5月23日

Fair Differentially Private Federated Learning Framework

Arxiv

0+阅读 · 2023年5月23日

On the (Im)Possibility of Estimating Various Notions of Differential Privacy

Arxiv

0+阅读 · 2023年5月23日

Privet: A Privacy-Preserving Vertical Federated Learning Service for Gradient Boosted Decision Tables

Arxiv

0+阅读 · 2023年5月22日

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning

Arxiv

0+阅读 · 2023年5月21日

On the Fairness Impacts of Private Ensembles Models

Arxiv

0+阅读 · 2023年5月19日

Free Lunch for Privacy Preserving Distributed Graph Learning

Arxiv

0+阅读 · 2023年5月19日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

Advances and Open Problems in Federated Learning

Advances and Open Problems in Federated Learning

Arxiv

18+阅读 · 2019年12月10日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

专知会员服务

83+阅读 · 2023年5月1日

【MIT博士论文】联邦学习实用方法，143页pdf

【MIT博士论文】联邦学习实用方法，143页pdf

专知会员服务

66+阅读 · 2022年9月24日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

专知会员服务

17+阅读 · 2022年3月19日

联邦学习隐私保护研究进展

专知会员服务

94+阅读 · 2021年7月23日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

联邦学习安全与隐私保护研究综述

专知会员服务

127+阅读 · 2020年8月7日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

专知会员服务

15+阅读 · 2019年11月18日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

「联邦学习模型安全与隐私」研究进展

「联邦学习模型安全与隐私」研究进展

专知

5+阅读 · 2022年9月24日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

47+阅读 · 2020年12月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

联邦学习或将助力IoT走出“数据孤岛”？

联邦学习或将助力IoT走出“数据孤岛”？

中国计算机学会

20+阅读 · 2019年3月16日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

【论文推荐】最新七篇图像检索相关论文—草图、Tie-Aware、场景图解析、叠加跨注意力机制、深度哈希、人群估计

专知

10+阅读 · 2018年4月22日

相关论文

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Arxiv

0+阅读 · 2023年5月24日

Towards Fleet-wide Sharing of Wind Turbine Condition Information through Privacy-preserving Federated Learning

Arxiv

0+阅读 · 2023年5月23日

Fair Differentially Private Federated Learning Framework

Arxiv

0+阅读 · 2023年5月23日

On the (Im)Possibility of Estimating Various Notions of Differential Privacy

Arxiv

0+阅读 · 2023年5月23日

Privet: A Privacy-Preserving Vertical Federated Learning Service for Gradient Boosted Decision Tables

Arxiv

0+阅读 · 2023年5月22日

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning

Arxiv

0+阅读 · 2023年5月21日

On the Fairness Impacts of Private Ensembles Models

Arxiv

0+阅读 · 2023年5月19日

Free Lunch for Privacy Preserving Distributed Graph Learning

Arxiv

0+阅读 · 2023年5月19日

A Survey of Deep Learning for Scientific Discovery

A Survey of Deep Learning for Scientific Discovery

Arxiv

29+阅读 · 2020年3月26日

Advances and Open Problems in Federated Learning

Advances and Open Problems in Federated Learning

Arxiv

18+阅读 · 2019年12月10日

相关基金

DNA碱基剪切修复在Gadd45a促CD4+T细胞DNA低甲基化中的作用及参与SLE发病的机制

国家自然科学基金

0+阅读 · 2014年12月31日

PTEN/β-catenin/Nanog干细胞通路调控鼻咽癌放疗抵抗的实验研究

国家自然科学基金

0+阅读 · 2013年12月31日

Cupriavidus basilensis B-8 对木质素降解机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

可见光活化分子氧产生羟基自由基降解有机污染物的新途径

国家自然科学基金

0+阅读 · 2012年12月31日

多角度、多目标靶向防治HPV16阳性子宫颈癌及相关机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

基于牛XY精子差异表达基因的性控DNA疫苗研究

国家自然科学基金

0+阅读 · 2012年12月31日

鼻咽癌易感基因TNFRSF19致癌分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

果蔬拟除虫菊酯农药残留高效降解酶Cpde分子改造研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员