半疏漏:对半监督学习进行的成员推论攻击 (Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning) - 专知论文

会员服务 ·

0

SSL · Learning · 推断 · Performer · 早停 ·

2022 年 7 月 25 日

Semi-Leak: Membership Inference Attacks Against Semi-supervised Learning

翻译：半疏漏:对半监督学习进行的成员推论攻击

Xinlei He,Hongbin Liu,Neil Zhenqiang Gong,Yang Zhang

from arxiv, Accepted to ECCV 2022

Semi-supervised learning (SSL) leverages both labeled and unlabeled data to train machine learning (ML) models. State-of-the-art SSL methods can achieve comparable performance to supervised learning by leveraging much fewer labeled data. However, most existing works focus on improving the performance of SSL. In this work, we take a different angle by studying the training data privacy of SSL. Specifically, we propose the first data augmentation-based membership inference attacks against ML models trained by SSL. Given a data sample and the black-box access to a model, the goal of membership inference attack is to determine whether the data sample belongs to the training dataset of the model. Our evaluation shows that the proposed attack can consistently outperform existing membership inference attacks and achieves the best performance against the model trained by SSL. Moreover, we uncover that the reason for membership leakage in SSL is different from the commonly believed one in supervised learning, i.e., overfitting (the gap between training and testing accuracy). We observe that the SSL model is well generalized to the testing data (with almost 0 overfitting) but ''memorizes'' the training data by giving a more confident prediction regardless of its correctness. We also explore early stopping as a countermeasure to prevent membership inference attacks against SSL. The results show that early stopping can mitigate the membership inference attack, but with the cost of model's utility degradation.

翻译：由半监督监督的学习(SSL)利用标签和未标记的数据来培训机器学习模式。先进的SSL 方法可以通过利用少得多的标签数据实现与受监督的学习的可比业绩。然而,大多数现有工作侧重于改进SSL的绩效。在这项工作中,我们从不同的角度研究SSL的培训数据隐私。具体地说,我们提议对SSL培训的ML模型进行首次基于数据增强的成员推断攻击。鉴于数据抽样和黑盒访问模型,成员推断攻击的目标是确定数据样本是否属于模型的培训数据集。我们的评估表明,拟议的攻击可以持续超过现有的成员推断攻击,并比SSL培训模型所培训的模型取得最佳业绩。此外,我们发现,SLSL成员流失的原因不同于通常相信的监督学习,即过度(培训和测试准确性之间的差距),因此SLSL模型与测试数据的测试数据非常普遍化(近0的效用样本属于该模型的培训数据集,而我们则通过早期的稳定性来降低攻击的稳定性。

0

相关内容

SSL

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

有丝分裂中Cdc20与相关调控蛋白的复合体结构与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型HER2抗体TPC对HER2阳性Trastuzumab耐受型乳腺癌的杀伤作用及分子机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

RNA解旋酶Prp5的翻译后修饰及其调控剪接机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

蛋白激酶LIMK1活性在小鼠卵母细胞染色体分离过程中的作用和分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

脂肪乳调控PI3K/Akt/GSK-3β信号通路逆转布比卡因心脏毒性的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Co基磁性Heusler合金相关体系相图与化合物的结构与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hedgehog信号通路调控宫颈癌上皮间质转化的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

赋值理论与几何不等式的研究

国家自然科学基金

1+阅读 · 2011年12月31日

松材线虫伴生细菌的分离鉴定及与宿主互作的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

BMP对猪肌内脂肪前体细胞分化聚酯的调控作用及信号通路

国家自然科学基金

0+阅读 · 2009年12月31日

AdvDO: Realistic Adversarial Attacks for Trajectory Prediction

Arxiv

0+阅读 · 2022年9月19日

Model Inversion Attacks against Graph Neural Networks

Arxiv

0+阅读 · 2022年9月19日

Membership Inference Attacks and Generalization: A Causal Perspective

Arxiv

0+阅读 · 2022年9月18日

UnSplit: Data-Oblivious Model Inversion, Model Stealing, and Label Inference Attacks Against Split Learning

Arxiv

0+阅读 · 2022年9月16日

CLIPping Privacy: Identity Inference Attacks on Multi-Modal Machine Learning Models

Arxiv

0+阅读 · 2022年9月15日

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

Arxiv

0+阅读 · 2022年9月15日

M^4I: Multi-modal Models Membership Inference

Arxiv

0+阅读 · 2022年9月15日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《战区安全决策课程体系》最新244页

《"无人机航母"原型平台》

任务规划与地形分析：现代复杂环境作战导航体系

《攻击场景描述形式化模型研究》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

【论文推荐】最新5篇度量学习（Metric Learning）相关论文—人脸验证、BIER、自适应图卷积、注意力机制、单次学习

专知

17+阅读 · 2018年2月11日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

AdvDO: Realistic Adversarial Attacks for Trajectory Prediction

Arxiv

0+阅读 · 2022年9月19日

Model Inversion Attacks against Graph Neural Networks

Arxiv

0+阅读 · 2022年9月19日

Membership Inference Attacks and Generalization: A Causal Perspective

Arxiv

0+阅读 · 2022年9月18日

UnSplit: Data-Oblivious Model Inversion, Model Stealing, and Label Inference Attacks Against Split Learning

Arxiv

0+阅读 · 2022年9月16日

CLIPping Privacy: Identity Inference Attacks on Multi-Modal Machine Learning Models

Arxiv

0+阅读 · 2022年9月15日

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training

Arxiv

0+阅读 · 2022年9月15日

M^4I: Multi-modal Models Membership Inference

Arxiv

0+阅读 · 2022年9月15日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

Data Augmentation for Graph Neural Networks

Arxiv

38+阅读 · 2020年12月2日

Deep Representation Learning for Domain Adaptation of Semantic Image Segmentation

Arxiv

10+阅读 · 2018年5月10日

相关基金

有丝分裂中Cdc20与相关调控蛋白的复合体结构与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

新型HER2抗体TPC对HER2阳性Trastuzumab耐受型乳腺癌的杀伤作用及分子机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

RNA解旋酶Prp5的翻译后修饰及其调控剪接机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

蛋白激酶LIMK1活性在小鼠卵母细胞染色体分离过程中的作用和分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

脂肪乳调控PI3K/Akt/GSK-3β信号通路逆转布比卡因心脏毒性的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Co基磁性Heusler合金相关体系相图与化合物的结构与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

Hedgehog信号通路调控宫颈癌上皮间质转化的作用及机制

国家自然科学基金

0+阅读 · 2011年12月31日

赋值理论与几何不等式的研究

国家自然科学基金

1+阅读 · 2011年12月31日

松材线虫伴生细菌的分离鉴定及与宿主互作的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

BMP对猪肌内脂肪前体细胞分化聚酯的调控作用及信号通路

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员