被动攻击对图像分类中触发器特性的系统评估 (SoK: A Systematic Evaluation of Backdoor Trigger Characteristics in Image Classification) - 专知论文

会员服务 ·

0

攻击 · 干扰攻击 · 图像分类 · 多参数 · GoogLeNet ·

2023 年 4 月 21 日

SoK: A Systematic Evaluation of Backdoor Trigger Characteristics in Image Classification

翻译：被动攻击对图像分类中触发器特性的系统评估

Gorka Abad,Jing Xu,Stefanos Koffas,Behrad Tajalli,Stjepan Picek,Mauro Conti

Deep learning achieves outstanding results in many machine learning tasks. Nevertheless, it is vulnerable to backdoor attacks that modify the training set to embed a secret functionality in the trained model. The modified training samples have a secret property, i. e., a trigger. At inference time, the secret functionality is activated when the input contains the trigger, while the model functions correctly in other cases. While there are many known backdoor attacks (and defenses), deploying a stealthy attack is still far from trivial. Successfully creating backdoor triggers depends on numerous parameters. Unfortunately, research has not yet determined which parameters contribute most to the attack performance. This paper systematically analyzes the most relevant parameters for the backdoor attacks, i.e., trigger size, position, color, and poisoning rate. Using transfer learning, which is very common in computer vision, we evaluate the attack on state-of-the-art models (ResNet, VGG, AlexNet, and GoogLeNet) and datasets (MNIST, CIFAR10, and TinyImageNet). Our attacks cover the majority of backdoor settings in research, providing concrete directions for future works. Our code is publicly available to facilitate the reproducibility of our results.

翻译：深度学习在许多机器学习任务中取得了杰出的成果。然而，它容易受到背景干扰攻击的影响，从而修改训练数据集并在训练模型中嵌入秘密功能。这些被修改的训练样本具有秘密属性，即触发器。在推理时，当输入包含触发器时，秘密功能会被激活，而在其他情况下，模型会正确运行。尽管已有许多已知的背景干扰攻击（和防御）方法，但成功创建背景干扰触发器仍然远非易事，需要考虑很多参数。不幸的是，尚未确定哪些参数对攻击性能的贡献最大。本文系统地分析了背景干扰攻击的最相关参数，如触发器大小，位置，颜色和污染率。使用计算机视觉中广泛使用的迁移学习，我们评估了现代模型（ResNet，VGG，AlexNet和GoogLeNet）和数据集（MNIST，CIFAR10和TinyImageNet）上的攻击。我们的攻击涵盖了研究中的大多数背景干扰设置，为未来的研究提供了具体的方向。我们的代码已公开发布，以便于我们的结果的可重复性。

0

相关内容

《自然语言处理中的对抗性攻击和防御技术》2022最新157页slides

《自然语言处理中的对抗性攻击和防御技术》2022最新157页slides

专知会员服务

36+阅读 · 2022年11月6日

ICLR2021 | 初探GNN的表示能力

专知会员服务

28+阅读 · 2021年5月2日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

【视频中的零样本动作识别：综述】Zero-Shot Action Recognition in Videos: A Survey

【视频中的零样本动作识别：综述】Zero-Shot Action Recognition in Videos: A Survey

专知会员服务

39+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知

16+阅读 · 2020年5月31日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

甲藻中新角藻属Neoceratium主要变种、变型分类地位的确定与修订

国家自然科学基金

0+阅读 · 2015年12月31日

函数空间的拓扑分类

国家自然科学基金

1+阅读 · 2014年12月31日

面向BYOD数据防护机制的多维脆弱性攻击研究

国家自然科学基金

3+阅读 · 2013年12月31日

软件定义网络的主动攻击与防御机制研究

国家自然科学基金

2+阅读 · 2013年12月31日

肿瘤激光热疗近红外实时疗效评估基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

恶性肿瘤微波消融近红外实时疗效评估应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

下一代互联网DDoS防御关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

CNTF激活的Ast与神经元间的对话交流在癫痫发病机制中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

面向Web服务器的DDoS攻击防御研究

国家自然科学基金

0+阅读 · 2009年12月31日

SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized Violence

Arxiv

0+阅读 · 2023年6月7日

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Arxiv

0+阅读 · 2023年6月6日

Jammer classification with Federated Learning

Arxiv

0+阅读 · 2023年6月5日

Joint Age-based Client Selection and Resource Allocation for Communication-Efficient Federated Learning over NOMA Networks

Arxiv

0+阅读 · 2023年6月4日

Mitigating Backdoor Attack Via Prerequisite Transformation

Arxiv

0+阅读 · 2023年6月3日

Model-Free Error Assessment for Breadth-First Studies, with Applications to Cell-Perturbation Experiments

Arxiv

0+阅读 · 2023年6月2日

Adversarial Attack Based on Prediction-Correction

Arxiv

0+阅读 · 2023年6月2日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

VIP会员

文章信息

相关主题

相关VIP内容

《自然语言处理中的对抗性攻击和防御技术》2022最新157页slides

《自然语言处理中的对抗性攻击和防御技术》2022最新157页slides

专知会员服务

36+阅读 · 2022年11月6日

ICLR2021 | 初探GNN的表示能力

专知会员服务

28+阅读 · 2021年5月2日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

【视频中的零样本动作识别：综述】Zero-Shot Action Recognition in Videos: A Survey

【视频中的零样本动作识别：综述】Zero-Shot Action Recognition in Videos: A Survey

专知会员服务

39+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知

16+阅读 · 2020年5月31日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

【推荐】NiftyNet：面向医学图像分析和图像引导治疗的开源CNN平台（附代码）

机器学习研究会

12+阅读 · 2018年1月27日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

SSIVD-Net: A Novel Salient Super Image Classification & Detection Technique for Weaponized Violence

Arxiv

0+阅读 · 2023年6月7日

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Arxiv

0+阅读 · 2023年6月6日

Jammer classification with Federated Learning

Arxiv

0+阅读 · 2023年6月5日

Joint Age-based Client Selection and Resource Allocation for Communication-Efficient Federated Learning over NOMA Networks

Arxiv

0+阅读 · 2023年6月4日

Mitigating Backdoor Attack Via Prerequisite Transformation

Arxiv

0+阅读 · 2023年6月3日

Model-Free Error Assessment for Breadth-First Studies, with Applications to Cell-Perturbation Experiments

Arxiv

0+阅读 · 2023年6月2日

Adversarial Attack Based on Prediction-Correction

Arxiv

0+阅读 · 2023年6月2日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

相关基金

甲藻中新角藻属Neoceratium主要变种、变型分类地位的确定与修订

国家自然科学基金

0+阅读 · 2015年12月31日

函数空间的拓扑分类

国家自然科学基金

1+阅读 · 2014年12月31日

面向BYOD数据防护机制的多维脆弱性攻击研究

国家自然科学基金

3+阅读 · 2013年12月31日

软件定义网络的主动攻击与防御机制研究

国家自然科学基金

2+阅读 · 2013年12月31日

肿瘤激光热疗近红外实时疗效评估基础研究

国家自然科学基金

0+阅读 · 2013年12月31日

恶性肿瘤微波消融近红外实时疗效评估应用基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

下一代互联网DDoS防御关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

CNTF激活的Ast与神经元间的对话交流在癫痫发病机制中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

面向Web服务器的DDoS攻击防御研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员