利用关注关系图蒸馏消除深神经网络的后门触发器 (Eliminating Backdoor Triggers for Deep Neural Networks Using Attention Relation Graph Distillation)

Due to the prosperity of Artificial Intelligence (AI) techniques, more and more backdoors are designed by adversaries to attack Deep Neural Networks (DNNs).Although the state-of-the-art method Neural Attention Distillation (NAD) can effectively erase backdoor triggers from DNNs, it still suffers from non-negligible Attack Success Rate (ASR) together with lowered classification ACCuracy (ACC), since NAD focuses on backdoor defense using attention features (i.e., attention maps) of the same order. In this paper, we introduce a novel backdoor defense framework named Attention Relation Graph Distillation (ARGD), which fully explores the correlation among attention features with different orders using our proposed Attention Relation Graphs (ARGs). Based on the alignment of ARGs between both teacher and student models during knowledge distillation, ARGD can eradicate more backdoor triggers than NAD. Comprehensive experimental results show that, against six latest backdoor attacks, ARGD outperforms NAD by up to 94.85% reduction in ASR, while ACC can be improved by up to 3.23%.

翻译：由于人工智能技术的繁荣,对手设计了越来越多的后门来攻击深神经网络。虽然最先进的神经注意力蒸馏法(NAD)能够有效地消除DNN的后门触发器,但它仍然受到不可忽略的攻击成功率(ASR)和降低的ACCUracy(ACC)的影响,因为NAD利用同一顺序的注意特征(即注意地图)专注于后门防御。在本文中,我们引入了一个名为“注意反应图蒸馏”(ARGD)的新颖的后门防御框架,它充分探讨注意特征与不同订单之间的关系,同时利用我们拟议的注意反应图(ARGs) 。根据教师和学生模型在知识蒸馏过程中的一致,ARGD可以消除比NAD更多的后门触发器。全面实验结果表明,针对最近的六次后门攻击(即注意地图),ARGD将NAD排出NAD, 减到ASR的94.85%,而ACC则可以改进到3.23%。

相关内容

注意力机制

关注 120

Attention机制最早是在视觉图像领域提出来的，但是真正火起来应该算是google mind团队的这篇论文《Recurrent Models of Visual Attention》[14]，他们在RNN模型上使用了attention机制来进行图像分类。随后，Bahdanau等人在论文《Neural Machine Translation by Jointly Learning to Align and Translate》 [1]中，使用类似attention的机制在机器翻译任务上将翻译和对齐同时进行，他们的工作算是是第一个提出attention机制应用到NLP领域中。接着类似的基于attention机制的RNN模型扩展开始应用到各种NLP任务中。最近，如何在CNN中使用attention机制也成为了大家的研究热点。下图表示了attention研究进展的大概趋势。

【MIT Sam Hopkins】如何读论文？How to Read a Paper

专知会员服务

108+阅读 · 2022年3月20日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日