DECK: 防御渗透式后门的示范硬件 (DECK: Model Hardening for Defending Pervasive Backdoors)

Pervasive backdoors are triggered by dynamic and pervasive input perturbations. They can be intentionally injected by attackers or naturally exist in normally trained models. They have a different nature from the traditional static and localized backdoors that can be triggered by perturbing a small input area with some fixed pattern, e.g., a patch with solid color. Existing defense techniques are highly effective for traditional backdoors. However, they may not work well for pervasive backdoors, especially regarding backdoor removal and model hardening. In this paper, we propose a novel model hardening technique against pervasive backdoors, including both natural and injected backdoors. We develop a general pervasive attack based on an encoder-decoder architecture enhanced with a special transformation layer. The attack can model a wide range of existing pervasive backdoor attacks and quantify them by class distances. As such, using the samples derived from our attack in adversarial training can harden a model against these backdoor vulnerabilities. Our evaluation on 9 datasets with 15 model structures shows that our technique can enlarge class distances by 59.65% on average with less than 1% accuracy degradation and no robustness loss, outperforming five hardening techniques such as adversarial training, universal adversarial training, MOTH, etc. It can reduce the attack success rate of six pervasive backdoor attacks from 99.06% to 1.94%, surpassing seven state-of-the-art backdoor removal techniques.

翻译：渗透式后门是动态和普遍输入干扰引发的。攻击者可以故意地注射, 或者自然地存在于通常经过训练的模型中。它们的性质不同于传统的静态和局部的后门, 以某些固定模式, 例如固色的修饰方式, 扰动一个小输入区, 从而触发了固定的后门。现有的防御技术对传统的后门非常有效。但是, 它们对于普遍的后门可能不太有效, 特别是在后门清除和模型变硬方面。在本文中, 我们提议了一种新型的强化技术, 对付普遍的后门, 包括自然和注射的后门。我们开发了一种基于以特殊变异层强化的编码器- 解码器结构的普遍攻击。攻击可以模拟一系列现有的普遍的后门攻击, 并且用阶级距离来量化这些攻击。因此, 使用从我们攻击后门后门的样本, 能够使一个模式中的这些弱点更硬的模型。我们对有15个模式结构的9个数据集的评估显示, 我们的技术可以将教室的距离平均扩大59. 65 %, 而不是精确性攻击的精确度, 摩托式攻击率。 1.94 和没有激烈的反向后门级训练率。成功率。成功率的技巧可以减少六级的技巧, 击退的技巧可以使这种技术,, 的技巧可以使这种技术从一个精确性攻击的的的的的的的的的的的的的的的的的的的的的的性攻击率性攻击率性攻击率的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的的

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日