防止利用自我扩大和兼容性进行数据中毒的可行保障措施 (Provable Guarantees against Data Poisoning Using Self-Expansion and Compatibility) - 专知论文

会员服务 ·

0

state-of-the-art · 训练集 · 弱学习器 · 情景 · 学习器 ·

2021 年 5 月 8 日

Provable Guarantees against Data Poisoning Using Self-Expansion and Compatibility

翻译：防止利用自我扩大和兼容性进行数据中毒的可行保障措施

Charles Jin,Melinda Sun,Martin Rinard

A recent line of work has shown that deep networks are highly susceptible to backdoor data poisoning attacks. Specifically, by injecting a small amount of malicious data into the training distribution, an adversary gains the ability to control the model's behavior during inference. In this work, we propose an iterative training procedure for removing poisoned data from the training set. Our approach consists of two steps. We first train an ensemble of weak learners to automatically discover distinct subpopulations in the training set. We then leverage a boosting framework to recover the clean data. Empirically, our method successfully defends against several state-of-the-art backdoor attacks, including both clean and dirty label attacks. We also present results from an independent third-party evaluation including a recent \textit{adaptive} poisoning adversary. The results indicate our approach is competitive with existing defenses against backdoor attacks on deep neural networks, and significantly outperforms the state-of-the-art in several scenarios.

翻译：最近的一项工作表明,深层网络极易受到后门数据中毒袭击。具体地说, 将少量恶意数据输入培训分布, 对手获得了在推断过程中控制模型行为的能力。在这项工作中, 我们提议了一个反复的培训程序, 将有毒数据从培训组中去除。我们的方法包括两个步骤。我们首先训练一组弱小的学习者, 以便自动发现培训组中不同的亚群。然后我们利用一个提升框架来恢复清洁数据。生动地说, 我们的方法成功地抵御了数起最先进的后门袭击, 包括清洁和肮脏的标签袭击。我们还介绍了一项独立的第三方评估的结果, 包括最近的一项\ textit{适应} 毒死敌。结果表明,我们的方法与现有的防御系统相比是竞争性的, 以对抗深神经网络的后门攻击, 并在几种情况下大大超越了最新技术。

0

相关内容

state-of-the-art

state-of-the-art

【南京大学】量子计算 (Spring 2021)课程

专知会员服务

59+阅读 · 2021年4月12日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

35+阅读 · 2020年12月28日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【KDD2020视频教程】小数据学习，116页ppt，Learning with Small Data，宾夕法尼亚州立大学

【KDD2020视频教程】小数据学习，116页ppt，Learning with Small Data，宾夕法尼亚州立大学

专知会员服务

23+阅读 · 2020年8月24日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【报告推荐 | HEC-Montreal唐建博士】图神经网络推理，附27页ppt

【报告推荐 | HEC-Montreal唐建博士】图神经网络推理，附27页ppt

专知会员服务

78+阅读 · 2019年11月13日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

预告 | AIS (ACL, IJCAI, SIGIR) 2019 论文报告会日程安排

预告 | AIS (ACL, IJCAI, SIGIR) 2019 论文报告会日程安排

PaperWeekly

6+阅读 · 2019年5月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2017年12月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Audio Attacks and Defenses against AED Systems -- A Practical Study

Audio Attacks and Defenses against AED Systems -- A Practical Study

Arxiv

0+阅读 · 2021年6月25日

Identifying malicious accounts in Blockchains using Domain Names and associated temporal properties

Arxiv

0+阅读 · 2021年6月25日

Closed-Form, Provable, and Robust PCA via Leverage Statistics and Innovation Search

Arxiv

0+阅读 · 2021年6月23日

Data Poisoning Attacks and Defenses to Crowdsourcing Systems

Arxiv

8+阅读 · 2021年2月18日

Early-Learning Regularization Prevents Memorization of Noisy Labels

Early-Learning Regularization Prevents Memorization of Noisy Labels

Arxiv

3+阅读 · 2020年6月30日

Deflecting Adversarial Attacks

Deflecting Adversarial Attacks

Arxiv

8+阅读 · 2020年2月18日

Robust Graph Neural Network Against Poisoning Attacks via Transfer Learning

Arxiv

6+阅读 · 2019年8月20日

Data Poisoning Attack against Unsupervised Node Embedding Methods

Arxiv

4+阅读 · 2018年10月30日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Arxiv

3+阅读 · 2018年8月2日

Are Generative Classifiers More Robust to Adversarial Attacks?

Are Generative Classifiers More Robust to Adversarial Attacks?

Arxiv

4+阅读 · 2018年7月9日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【南京大学】量子计算 (Spring 2021)课程

专知会员服务

59+阅读 · 2021年4月12日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

35+阅读 · 2020年12月28日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【KDD2020视频教程】小数据学习，116页ppt，Learning with Small Data，宾夕法尼亚州立大学

【KDD2020视频教程】小数据学习，116页ppt，Learning with Small Data，宾夕法尼亚州立大学

专知会员服务

23+阅读 · 2020年8月24日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

112+阅读 · 2020年5月15日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【报告推荐 | HEC-Montreal唐建博士】图神经网络推理，附27页ppt

【报告推荐 | HEC-Montreal唐建博士】图神经网络推理，附27页ppt

专知会员服务

78+阅读 · 2019年11月13日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

美军“泰坦（TITAN）地面站目标系统”：是颠覆还是一场可预见的军事进步？

美空军指挥参谋学院 · 联合空中作战规划课程介绍（2025年） | 22页

一种基于视觉算法生成三维场景重建的多任务系统 | 2025最新200页

北约第十七届（2025年）网络冲突国际会议论文集 | 272页

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

预告 | AIS (ACL, IJCAI, SIGIR) 2019 论文报告会日程安排

预告 | AIS (ACL, IJCAI, SIGIR) 2019 论文报告会日程安排

PaperWeekly

6+阅读 · 2019年5月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

已删除

将门创投

4+阅读 · 2017年12月12日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Audio Attacks and Defenses against AED Systems -- A Practical Study

Audio Attacks and Defenses against AED Systems -- A Practical Study

Arxiv

0+阅读 · 2021年6月25日

Identifying malicious accounts in Blockchains using Domain Names and associated temporal properties

Arxiv

0+阅读 · 2021年6月25日

Closed-Form, Provable, and Robust PCA via Leverage Statistics and Innovation Search

Arxiv

0+阅读 · 2021年6月23日

Data Poisoning Attacks and Defenses to Crowdsourcing Systems

Arxiv

8+阅读 · 2021年2月18日

Early-Learning Regularization Prevents Memorization of Noisy Labels

Early-Learning Regularization Prevents Memorization of Noisy Labels

Arxiv

3+阅读 · 2020年6月30日

Deflecting Adversarial Attacks

Deflecting Adversarial Attacks

Arxiv

8+阅读 · 2020年2月18日

Robust Graph Neural Network Against Poisoning Attacks via Transfer Learning

Arxiv

6+阅读 · 2019年8月20日

Data Poisoning Attack against Unsupervised Node Embedding Methods

Arxiv

4+阅读 · 2018年10月30日

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees

Arxiv

3+阅读 · 2018年8月2日

Are Generative Classifiers More Robust to Adversarial Attacks?

Are Generative Classifiers More Robust to Adversarial Attacks?

Arxiv

4+阅读 · 2018年7月9日

微信扫码咨询专知VIP会员