利用成员推推推攻击成员对隐隐蔽语言模型的隐私风险进行量化 (Quantifying Privacy Risks of Masked Language Models Using Membership Inference Attacks) - 专知论文

会员服务 ·

0

掩码语言模型化 · 推断 · 语言模型化 · MoDELS · 掩码 ·

2022 年 11 月 4 日

Quantifying Privacy Risks of Masked Language Models Using Membership Inference Attacks

翻译：利用成员推推推攻击成员对隐隐蔽语言模型的隐私风险进行量化

Fatemehsadat Mireshghallah,Kartik Goyal,Archit Uniyal,Taylor Berg-Kirkpatrick,Reza Shokri

The wide adoption and application of Masked language models~(MLMs) on sensitive data (from legal to medical) necessitates a thorough quantitative investigation into their privacy vulnerabilities -- to what extent do MLMs leak information about their training data? Prior attempts at measuring leakage of MLMs via membership inference attacks have been inconclusive, implying the potential robustness of MLMs to privacy attacks. In this work, we posit that prior attempts were inconclusive because they based their attack solely on the MLM's model score. We devise a stronger membership inference attack based on likelihood ratio hypothesis testing that involves an additional reference MLM to more accurately quantify the privacy risks of memorization in MLMs. We show that masked language models are extremely susceptible to likelihood ratio membership inference attacks: Our empirical results, on models trained on medical notes, show that our attack improves the AUC of prior membership inference attacks from 0.66 to an alarmingly high 0.90 level, with a significant improvement in the low-error region: at 1% false positive rate, our attack is 51X more powerful than prior work.

翻译：广泛采用和应用关于敏感数据(从法律到医疗)的蒙面语言模型~(MLMs),需要对隐私脆弱性进行彻底的定量调查 -- -- MLMs泄漏有关其培训数据的信息的程度有多大?以前试图通过会员推论攻击测量MLMs渗漏的可能性是没有结果的,这意味着MLMs对隐私攻击具有潜在的稳健性。在这项工作中,我们假设先前的尝试是没有结果的,因为它们完全以MLM模型的得分作为攻击的依据。我们根据概率比假设测试设计了更强烈的会员推论攻击,这需要更多参考MLM(M)来更准确地量化MLMs中记忆化的隐私风险。我们表明,蒙面语言模型极有可能受到会员推论攻击的可能性:我们关于医学笔记培训模型的经验结果表明,我们的攻击使AUC公司先前的推论攻击从0.66增加到惊人的0.90级,低eror地区的情况有了显著的改善:以1%的假正率,我们的攻击比以前的工作更强大。

0

相关内容

掩码语言模型化

掩码语言模型化

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

TRAF3IP3调控T细胞活性与肿瘤免疫的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

CSP I-plus 修饰的内皮抑制素靶向抑制肝细胞癌转移的研究

国家自然科学基金

0+阅读 · 2015年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Hippo信号通路调控间充质干细胞向ARDS肺泡上皮细胞分化的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ipr1基因介导巨噬细胞凋亡的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型抑癌基因ZHX2调节脂质代谢参与非酒精性脂肪肝的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

TWIST在胃癌多药耐药中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA RP3-337D23.3促进非吸烟者肺腺癌侵袭转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

删除CD4+CD25+Treg协同A549-RNA-DC疫苗诱导抗肿瘤免疫应答作用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

折叠式人工玻璃体缓释PKCα25233;制剂防治PVR的研究

国家自然科学基金

0+阅读 · 2009年12月31日

On the Risks of Collecting Multidimensional Data Under Local Differential Privacy

Arxiv

0+阅读 · 2022年12月28日

Efficient Graph Neural Network Inference at Large Scale

Arxiv

0+阅读 · 2022年12月28日

A Labelled Sample Compression Scheme of Size at Most Quadratic in the VC Dimension

Arxiv

0+阅读 · 2022年12月28日

Controllable Data Generation by Deep Learning: A Review

Arxiv

15+阅读 · 2022年7月19日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Graph Signal Processing -- Part I: Graphs, Graph Spectra, and Spectral Clustering

Arxiv

14+阅读 · 2019年8月12日

VIP会员

文章信息

相关主题

掩码语言模型化

语言模型化

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

On the Risks of Collecting Multidimensional Data Under Local Differential Privacy

Arxiv

0+阅读 · 2022年12月28日

Efficient Graph Neural Network Inference at Large Scale

Arxiv

0+阅读 · 2022年12月28日

A Labelled Sample Compression Scheme of Size at Most Quadratic in the VC Dimension

Arxiv

0+阅读 · 2022年12月28日

Controllable Data Generation by Deep Learning: A Review

Arxiv

15+阅读 · 2022年7月19日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Towards Out-Of-Distribution Generalization: A Survey

Arxiv

38+阅读 · 2021年8月31日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Graph Signal Processing -- Part I: Graphs, Graph Spectra, and Spectral Clustering

Arxiv

14+阅读 · 2019年8月12日

相关基金

TRAF3IP3调控T细胞活性与肿瘤免疫的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

CSP I-plus 修饰的内皮抑制素靶向抑制肝细胞癌转移的研究

国家自然科学基金

0+阅读 · 2015年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

Hippo信号通路调控间充质干细胞向ARDS肺泡上皮细胞分化的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Ipr1基因介导巨噬细胞凋亡的作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新型抑癌基因ZHX2调节脂质代谢参与非酒精性脂肪肝的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

TWIST在胃癌多药耐药中的作用及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA RP3-337D23.3促进非吸烟者肺腺癌侵袭转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

删除CD4+CD25+Treg协同A549-RNA-DC疫苗诱导抗肿瘤免疫应答作用的研究

国家自然科学基金

0+阅读 · 2011年12月31日

折叠式人工玻璃体缓释PKCα25233;制剂防治PVR的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员