机器学习模型的后门是否有助于成员推理攻击？ (Do Backdoors Assist Membership Inference Attacks?) - 专知论文

会员服务 ·

0

推断 · 样本 · 损失 · MoDELS · 异常点 ·

2023 年 3 月 22 日

Do Backdoors Assist Membership Inference Attacks?

翻译：机器学习模型的后门是否有助于成员推理攻击？

Yumeki Goto,Nami Ashizawa,Toshiki Shibahara,Naoto Yanai

When an adversary provides poison samples to a machine learning model, privacy leakage, such as membership inference attacks that infer whether a sample was included in the training of the model, becomes effective by moving the sample to an outlier. However, the attacks can be detected because inference accuracy deteriorates due to poison samples. In this paper, we discuss a \textit{backdoor-assisted membership inference attack}, a novel membership inference attack based on backdoors that return the adversary's expected output for a triggered sample. We found three crucial insights through experiments with an academic benchmark dataset. We first demonstrate that the backdoor-assisted membership inference attack is unsuccessful. Second, when we analyzed loss distributions to understand the reason for the unsuccessful results, we found that backdoors cannot separate loss distributions of training and non-training samples. In other words, backdoors cannot affect the distribution of clean samples. Third, we also show that poison and triggered samples activate neurons of different distributions. Specifically, backdoors make any clean sample an inlier, contrary to poisoning samples. As a result, we confirm that backdoors cannot assist membership inference.

翻译：当攻击者向机器学习模型提供毒样本时，会产生隐私泄露，例如成员推理攻击，这种攻击可以推断出样本是否包含在模型的训练中。然而，由于毒样本的存在，攻击是可以被检测到的，因为推理准确度会不断下降。在本文中，我们讨论了一种新型的基于后门的成员推理攻击，这种攻击利用后门返回攻击者预期的输出，针对触发样本进行推理。通过使用一个学术基准数据集的实验，我们得出了三个关键见解。我们首先证明了后门辅助的成员推理攻击是不成功的。其次，在分析损失分布以理解不成功结果的原因时，我们发现后门不能将训练和非训练样本的损失分布分离。换句话说，后门不能影响干净样本的分布。第三，我们还显示了毒样本和触发样本激活不同分布的神经元。具体而言，后门将任何干净样本变成内部样本，与毒样本相反。因此，我们确认后门不能协助成员推理攻击。

0

相关内容

【AAAI 2022】机器学习模型的解释方法效果如何？MIT、微软学者为你解读，Do Feature Attribution Methods Correctly Attribute Features?

【AAAI 2022】机器学习模型的解释方法效果如何？MIT、微软学者为你解读，Do Feature Attribution Methods Correctly Attribute Features?

专知会员服务

31+阅读 · 2022年3月12日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

黑盒机器学习模型的成员推断攻击研究

专知会员服务

23+阅读 · 2021年8月22日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

首篇《后门学习综述》论文发布，阐述AI系统训练过程的安全性问题

专知会员服务

30+阅读 · 2020年11月21日

近期必读的六篇 ICML 2020【因果推理】相关论文

近期必读的六篇 ICML 2020【因果推理】相关论文

专知会员服务

88+阅读 · 2020年9月8日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

PaperWeekly

0+阅读 · 2022年11月7日

芝加哥大学计算机系助理教授Grant Ho招募计算机安全方向博士 / 硕士 / 实习生（2023 春 / 秋）

芝加哥大学计算机系助理教授Grant Ho招募计算机安全方向博士 / 硕士 / 实习生（2023 春 / 秋）

机器之心

0+阅读 · 2022年9月27日

Pytorch里面多任务Loss是加起来还是分别backward？

Pytorch里面多任务Loss是加起来还是分别backward？

极市平台

0+阅读 · 2022年6月29日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

提高GAN训练稳定性的9大tricks

提高GAN训练稳定性的9大tricks

人工智能前沿讲习班

13+阅读 · 2019年3月19日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

SIRT1调控miR-15b-5p转录的新机制及其在结直肠癌转移的作用

国家自然科学基金

0+阅读 · 2015年12月31日

基于MRI UTE成像研究腺苷对前交叉韧带重建后关节软骨及半月板变性的影响及机制

国家自然科学基金

0+阅读 · 2015年12月31日

RACK1与Nanog相互作用参与肝细胞癌干细胞自我更新的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

TCDD经SSeCKS/TRAF6通路诱导星形胶质细胞激活致神经毒性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

分布式推理中的拜占庭数据攻击及其防御方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

RNA结合蛋白RBM24调控miR-25-3p及lncRNA MALAT1抑制鼻咽癌的机制

国家自然科学基金

0+阅读 · 2013年12月31日

99mTc标记树状大分子包裹金纳米颗粒偶联Duramycin对肿瘤化疗诱导细胞凋亡的分子影像学研究

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA调节胶质瘤放化疗敏感性分子机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

银屑病患者骨髓造血干细胞定向分化为T细胞过程Notch信号转导通路的动态研究

国家自然科学基金

0+阅读 · 2009年12月31日

LMP1和BHRF1表达与EBV相关胃癌和鼻咽癌血管、淋巴管生成分子机制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Benchmarks and leaderboards for sound demixing tasks

Arxiv

0+阅读 · 2023年5月12日

Assault and Battery: Evaluating the Security of Power Conversion Systems Against Electromagnetic Injection Attacks

Arxiv

0+阅读 · 2023年5月11日

Run-Off Election: Improved Provable Defense against Data Poisoning Attacks

Arxiv

0+阅读 · 2023年5月11日

Invisible Backdoor Attack with Dynamic Triggers against Person Re-identification

Arxiv

0+阅读 · 2023年5月10日

Quantization Aware Attack: Enhancing the Transferability of Adversarial Attacks across Target Models with Different Quantization Bitwidths

Arxiv

0+阅读 · 2023年5月10日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

VIP会员

文章信息

相关主题

相关VIP内容

【AAAI 2022】机器学习模型的解释方法效果如何？MIT、微软学者为你解读，Do Feature Attribution Methods Correctly Attribute Features?

【AAAI 2022】机器学习模型的解释方法效果如何？MIT、微软学者为你解读，Do Feature Attribution Methods Correctly Attribute Features?

专知会员服务

31+阅读 · 2022年3月12日

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

【MIT-ICLR2022】在机器学习模型中注入公平性, Injecting fairness into machine-learning models

专知会员服务

22+阅读 · 2022年3月7日

黑盒机器学习模型的成员推断攻击研究

专知会员服务

23+阅读 · 2021年8月22日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

首篇《后门学习综述》论文发布，阐述AI系统训练过程的安全性问题

专知会员服务

30+阅读 · 2020年11月21日

近期必读的六篇 ICML 2020【因果推理】相关论文

近期必读的六篇 ICML 2020【因果推理】相关论文

专知会员服务

88+阅读 · 2020年9月8日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

PaperWeekly

0+阅读 · 2022年11月7日

芝加哥大学计算机系助理教授Grant Ho招募计算机安全方向博士 / 硕士 / 实习生（2023 春 / 秋）

芝加哥大学计算机系助理教授Grant Ho招募计算机安全方向博士 / 硕士 / 实习生（2023 春 / 秋）

机器之心

0+阅读 · 2022年9月27日

Pytorch里面多任务Loss是加起来还是分别backward？

Pytorch里面多任务Loss是加起来还是分别backward？

极市平台

0+阅读 · 2022年6月29日

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

征稿 | International Joint Conference on Knowledge Graphs (IJCKG)

开放知识图谱

2+阅读 · 2022年5月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

提高GAN训练稳定性的9大tricks

提高GAN训练稳定性的9大tricks

人工智能前沿讲习班

13+阅读 · 2019年3月19日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI/ML/DNN硬件加速设计怎么入门？

AI/ML/DNN硬件加速设计怎么入门？

StarryHeavensAbove

11+阅读 · 2018年12月4日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

相关论文

Benchmarks and leaderboards for sound demixing tasks

Arxiv

0+阅读 · 2023年5月12日

Assault and Battery: Evaluating the Security of Power Conversion Systems Against Electromagnetic Injection Attacks

Arxiv

0+阅读 · 2023年5月11日

Run-Off Election: Improved Provable Defense against Data Poisoning Attacks

Arxiv

0+阅读 · 2023年5月11日

Invisible Backdoor Attack with Dynamic Triggers against Person Re-identification

Arxiv

0+阅读 · 2023年5月10日

Quantization Aware Attack: Enhancing the Transferability of Adversarial Attacks across Target Models with Different Quantization Bitwidths

Arxiv

0+阅读 · 2023年5月10日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

Generative Adversarial Networks and Probabilistic Graph Models for Hyperspectral Image Classification

Arxiv

11+阅读 · 2018年2月10日

相关基金

SIRT1调控miR-15b-5p转录的新机制及其在结直肠癌转移的作用

国家自然科学基金

0+阅读 · 2015年12月31日

基于MRI UTE成像研究腺苷对前交叉韧带重建后关节软骨及半月板变性的影响及机制

国家自然科学基金

0+阅读 · 2015年12月31日

RACK1与Nanog相互作用参与肝细胞癌干细胞自我更新的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

TCDD经SSeCKS/TRAF6通路诱导星形胶质细胞激活致神经毒性的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

分布式推理中的拜占庭数据攻击及其防御方法研究

国家自然科学基金

3+阅读 · 2014年12月31日

RNA结合蛋白RBM24调控miR-25-3p及lncRNA MALAT1抑制鼻咽癌的机制

国家自然科学基金

0+阅读 · 2013年12月31日

99mTc标记树状大分子包裹金纳米颗粒偶联Duramycin对肿瘤化疗诱导细胞凋亡的分子影像学研究

国家自然科学基金

0+阅读 · 2013年12月31日

长链非编码RNA调节胶质瘤放化疗敏感性分子机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

银屑病患者骨髓造血干细胞定向分化为T细胞过程Notch信号转导通路的动态研究

国家自然科学基金

0+阅读 · 2009年12月31日

LMP1和BHRF1表达与EBV相关胃癌和鼻咽癌血管、淋巴管生成分子机制的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员