通过隐隐性超高频对后门进行反向不学习 (Adversarial Unlearning of Backdoors via Implicit Hypergradient) - 专知论文

会员服务 ·

0

Performer · 情景 · 稳健性 · 基准 · Better ·

2022 年 1 月 22 日

Adversarial Unlearning of Backdoors via Implicit Hypergradient

翻译：通过隐隐性超高频对后门进行反向不学习

Yi Zeng,Si Chen,Won Park,Z. Morley Mao,Ming Jin,Ruoxi Jia

from arxiv, 9 pages main text, 3 pages references, 12 pages appendix, 5 figures

We propose a minimax formulation for removing backdoors from a given poisoned model based on a small set of clean data. This formulation encompasses much of prior work on backdoor removal. We propose the Implicit Bacdoor Adversarial Unlearning (I-BAU) algorithm to solve the minimax. Unlike previous work, which breaks down the minimax into separate inner and outer problems, our algorithm utilizes the implicit hypergradient to account for the interdependence between inner and outer optimization. We theoretically analyze its convergence and the generalizability of the robustness gained by solving minimax on clean data to unseen test data. In our evaluation, we compare I-BAU with six state-of-art backdoor defenses on seven backdoor attacks over two datasets and various attack settings, including the common setting where the attacker targets one class as well as important but underexplored settings where multiple classes are targeted. I-BAU's performance is comparable to and most often significantly better than the best baseline. Particularly, its performance is more robust to the variation on triggers, attack settings, poison ratio, and clean data size. Moreover, I-BAU requires less computation to take effect; particularly, it is more than $13\times$ faster than the most efficient baseline in the single-target attack setting. Furthermore, it can remain effective in the extreme case where the defender can only access 100 clean samples -- a setting where all the baselines fail to produce acceptable results.

翻译：我们基于一组小的清洁数据,提出将后门从一个有毒模型中清除后门的小型分子配方。这一配方包含许多先前关于后门清除的工作。我们建议采用隐性巴克门反反反学习(I-BAU)算法来解决迷你马克。与以前的工作不同,以前的工作把迷你马克分为单独的内外部问题,我们的算法利用隐含的高度梯度来说明内外部优化之间的相互依存性。我们从理论上分析其趋同性以及通过解决关于清洁数据的迷你数据与隐蔽测试数据之间的普遍可靠性。在我们的评估中,我们将I-BAU与针对两个数据集和各种攻击设置的七次后门攻击的六种最先进的后门防御(I-BAU)比较,包括攻击者针对一个等级和重要但未得到充分探索的情景的共同设置。I-BAU的性能与最佳基线相比,其性能更强于最佳基准值。此外,在最短的基底值中,最短的基数是,最短的基数是,最短的基数是,最短的基数,最低的基数的基数是,最低的基数,最低的基数的基数是最低的基数,最低的基数,最低的基数的基数是,最低的基数的基数的基数的基数,最低的基数是更低的基数的基数是更低,最低为低。

0

相关内容

Performer

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【AAMSA 2019 | tutorial】对抗机器学习 Adversarial Machine Learning，伊利诺伊大学厄巴纳-香槟分校|Bo Li，圣路易斯华盛顿大学|Yevgeniy Vorobeychik

【AAMSA 2019 | tutorial】对抗机器学习 Adversarial Machine Learning，伊利诺伊大学厄巴纳-香槟分校|Bo Li，圣路易斯华盛顿大学|Yevgeniy Vorobeychik

专知会员服务

28+阅读 · 2019年5月13日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

高斯序列与过程的极值理论

国家自然科学基金

2+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

几类不可积系统行波解的分岔

国家自然科学基金

0+阅读 · 2013年12月31日

若干反二次特征值问题的优化方法及其在动力系统中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

多元整数值GARCH模型的统计分析

国家自然科学基金

0+阅读 · 2012年12月31日

MARK2通过调节微管相关蛋白介导Nogo-66抑制轴突生长

国家自然科学基金

0+阅读 · 2012年12月31日

高阶多元Markov链及其非负张量模型的理论与数值分析

国家自然科学基金

1+阅读 · 2012年12月31日

智能电网安全经济运行中的风险约束多阶段随机优化问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

李斯特菌载体在增强丙型肝炎病毒重组多表位树突细胞疫苗中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

图和复杂网络的谱分析

国家自然科学基金

1+阅读 · 2009年12月31日

Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach

Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach

Arxiv

0+阅读 · 2022年4月20日

Case-Aware Adversarial Training

Arxiv

0+阅读 · 2022年4月20日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Dual-Key Multimodal Backdoors for Visual Question Answering

Arxiv

1+阅读 · 2022年4月18日

Towards Robust Neural Networks via Orthogonal Diversity

Towards Robust Neural Networks via Orthogonal Diversity

Arxiv

0+阅读 · 2022年4月18日

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Arxiv

0+阅读 · 2022年4月15日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【AAMSA 2019 | tutorial】对抗机器学习 Adversarial Machine Learning，伊利诺伊大学厄巴纳-香槟分校|Bo Li，圣路易斯华盛顿大学|Yevgeniy Vorobeychik

【AAMSA 2019 | tutorial】对抗机器学习 Adversarial Machine Learning，伊利诺伊大学厄巴纳-香槟分校|Bo Li，圣路易斯华盛顿大学|Yevgeniy Vorobeychik

专知会员服务

28+阅读 · 2019年5月13日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach

Adversarial Regularization as Stackelberg Game: An Unrolled Optimization Approach

Arxiv

0+阅读 · 2022年4月20日

Case-Aware Adversarial Training

Arxiv

0+阅读 · 2022年4月20日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Dual-Key Multimodal Backdoors for Visual Question Answering

Arxiv

1+阅读 · 2022年4月18日

Towards Robust Neural Networks via Orthogonal Diversity

Towards Robust Neural Networks via Orthogonal Diversity

Arxiv

0+阅读 · 2022年4月18日

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Arxiv

0+阅读 · 2022年4月15日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

Adversarial Multimodal Representation Learning for Click-Through Rate Prediction

Arxiv

23+阅读 · 2020年3月7日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

相关基金

高斯序列与过程的极值理论

国家自然科学基金

2+阅读 · 2015年12月31日

套子代数的Hochschild上同调及套的分类

国家自然科学基金

3+阅读 · 2014年12月31日

几类不可积系统行波解的分岔

国家自然科学基金

0+阅读 · 2013年12月31日

若干反二次特征值问题的优化方法及其在动力系统中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

多元整数值GARCH模型的统计分析

国家自然科学基金

0+阅读 · 2012年12月31日

MARK2通过调节微管相关蛋白介导Nogo-66抑制轴突生长

国家自然科学基金

0+阅读 · 2012年12月31日

高阶多元Markov链及其非负张量模型的理论与数值分析

国家自然科学基金

1+阅读 · 2012年12月31日

智能电网安全经济运行中的风险约束多阶段随机优化问题研究

国家自然科学基金

0+阅读 · 2011年12月31日

李斯特菌载体在增强丙型肝炎病毒重组多表位树突细胞疫苗中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

图和复杂网络的谱分析

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员