防止后门袭击的反向培训的效果 (On the Effectiveness of Adversarial Training against Backdoor Attacks) - 专知论文

会员服务 ·

0

Extensibility · MoDELS · 稳健性 · Notability · 样例 ·

2022 年 2 月 22 日

On the Effectiveness of Adversarial Training against Backdoor Attacks

翻译：防止后门袭击的反向培训的效果

Yinghua Gao,Dongxian Wu,Jingfeng Zhang,Guanhao Gan,Shu-Tao Xia,Gang Niu,Masashi Sugiyama

DNNs' demand for massive data forces practitioners to collect data from the Internet without careful check due to the unacceptable cost, which brings potential risks of backdoor attacks. A backdoored model always predicts a target class in the presence of a predefined trigger pattern, which can be easily realized via poisoning a small amount of data. In general, adversarial training is believed to defend against backdoor attacks since it helps models to keep their prediction unchanged even if we perturb the input image (as long as within a feasible range). Unfortunately, few previous studies succeed in doing so. To explore whether adversarial training could defend against backdoor attacks or not, we conduct extensive experiments across different threat models and perturbation budgets, and find the threat model in adversarial training matters. For instance, adversarial training with spatial adversarial examples provides notable robustness against commonly-used patch-based backdoor attacks. We further propose a hybrid strategy which provides satisfactory robustness across different backdoor attacks.

翻译：DNNs对大量数据的需求迫使执业者在不仔细检查的情况下从互联网上收集数据,而没有仔细检查,因为成本令人无法接受,这会带来幕后攻击的潜在风险。一个幕后模式总是在预先确定的触发模式面前预测目标类别,这种模式通过毒害少量数据很容易实现。一般而言,对抗性培训被认为可以防范幕后攻击,因为它有助于模式保持其预测不变,即使我们干扰输入图像(只要在可行的范围内),但不幸的是,以前的研究很少能成功这样做。为了探究对抗性培训能否抵御幕后攻击,我们在不同的威胁模式和扰动预算方面进行广泛的实验,并在对抗性训练事项中找到威胁模式。例如,带有空间对抗性实例的对抗性培训为通常使用的隔间幕后攻击提供了明显的强力。我们进一步提议了一个混合战略,为不同幕后攻击提供令人满意的强力。

0

相关内容

Extensibility

iOS 8 提供的应用间和应用跟系统的功能交互特性。

Today (iOS and OS X): widgets for the Today view of Notification Center
Share (iOS and OS X): post content to web services or share content with others
Actions (iOS and OS X): app extensions to view or manipulate inside another app
Photo Editing (iOS): edit a photo or video in Apple's Photos app with extensions from a third-party apps
Finder Sync (OS X): remote file storage in the Finder with support for Finder content annotation
Storage Provider (iOS): an interface between files inside an app and other apps on a user's device
Custom Keyboard (iOS): system-wide alternative keyboards

Source: iOS 8 Extensions: Apple’s Plan for a Powerful App Ecosystem

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

专知会员服务

24+阅读 · 2020年2月22日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

内皮细胞microRNA在介导血流剪切力对血管平滑肌功能调控中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

p53转录后多样化修饰在糖尿病心肌纤维化的发生及进展中的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

α4整合素与配体识别的分子机制及基于结构的小分子抑制剂设计

国家自然科学基金

0+阅读 · 2015年12月31日

猪GnIH与GnRH介导的细胞内信号通路交联节点调控促性腺激素转录的机制

国家自然科学基金

0+阅读 · 2013年12月31日

用于H2S完全分解制氢的非平衡等离子体-光催化体系构建

国家自然科学基金

0+阅读 · 2011年12月31日

影响海底沉积物声学特性的环境因素研究

国家自然科学基金

0+阅读 · 2011年12月31日

智能电网信息系统的体系结构和验证环境

国家自然科学基金

1+阅读 · 2011年12月31日

自组装单分子膜中原子结构和电子性质的NEXAFS研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁性原子吸附的纳米管异质结磁性和电子输运性质的理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

ROS在细菌抗生素应激中的作用及与毒素-抗毒素系统相关性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Indiscriminate Data Poisoning Attacks on Neural Networks

Arxiv

0+阅读 · 2022年4月19日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Arxiv

0+阅读 · 2022年4月15日

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Arxiv

0+阅读 · 2022年4月15日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

Arxiv

18+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

【CVPR 2022】可转移的稀疏对抗性攻击，Transferable Sparse Adversarial Attack

专知会员服务

15+阅读 · 2022年3月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

专知会员服务

24+阅读 · 2020年2月22日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《人与智能体在系统工程建模语言V2任务中的性能表现：基于用户中心化的评估方法》308页

《数据安全国家标准体系（2025版）》征求意见稿

AlphaMosaic：人工智能赋能的作战管理系统

《军事行动中通信平台的战略价值：提升战术效能与作战优势》

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

相关论文

Indiscriminate Data Poisoning Attacks on Neural Networks

Arxiv

0+阅读 · 2022年4月19日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information

Arxiv

0+阅读 · 2022年4月15日

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Arxiv

0+阅读 · 2022年4月15日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

Arxiv

18+阅读 · 2018年1月5日

相关基金

内皮细胞microRNA在介导血流剪切力对血管平滑肌功能调控中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

p53转录后多样化修饰在糖尿病心肌纤维化的发生及进展中的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

α4整合素与配体识别的分子机制及基于结构的小分子抑制剂设计

国家自然科学基金

0+阅读 · 2015年12月31日

猪GnIH与GnRH介导的细胞内信号通路交联节点调控促性腺激素转录的机制

国家自然科学基金

0+阅读 · 2013年12月31日

用于H2S完全分解制氢的非平衡等离子体-光催化体系构建

国家自然科学基金

0+阅读 · 2011年12月31日

影响海底沉积物声学特性的环境因素研究

国家自然科学基金

0+阅读 · 2011年12月31日

智能电网信息系统的体系结构和验证环境

国家自然科学基金

1+阅读 · 2011年12月31日

自组装单分子膜中原子结构和电子性质的NEXAFS研究

国家自然科学基金

0+阅读 · 2009年12月31日

磁性原子吸附的纳米管异质结磁性和电子输运性质的理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

ROS在细菌抗生素应激中的作用及与毒素-抗毒素系统相关性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员