Meta 梯度反逆攻击 (Meta Gradient Adversarial Attack)

In recent years, research on adversarial attacks has become a hot spot. Although current literature on the transfer-based adversarial attack has achieved promising results for improving the transferability to unseen black-box models, it still leaves a long way to go. Inspired by the idea of meta-learning, this paper proposes a novel architecture called Meta Gradient Adversarial Attack (MGAA), which is plug-and-play and can be integrated with any existing gradient-based attack method for improving the cross-model transferability. Specifically, we randomly sample multiple models from a model zoo to compose different tasks and iteratively simulate a white-box attack and a black-box attack in each task. By narrowing the gap between the gradient directions in white-box and black-box attacks, the transferability of adversarial examples on the black-box setting can be improved. Extensive experiments on the CIFAR10 and ImageNet datasets show that our architecture outperforms the state-of-the-art methods for both black-box and white-box attack settings.

翻译：近些年来,关于对抗性攻击的研究已成为热点。尽管目前关于以转移为基础的对抗性攻击的文献在改进向隐蔽黑盒模式的可转移性方面取得了可喜的成果,但仍任重道远。在元学习理念的启发下,本文提出了名为Meta Gradient Aversarial Attack(MGAAA)的新结构,这是一个插座和游戏,可以与现有的任何基于梯度的攻击方法相结合,以改进跨模范转移性。具体地说,我们随机抽样从一个模型动物园抽取多个模型,以组成不同的任务,并反复模拟白盒攻击和黑盒攻击。通过缩小白盒攻击中的梯度方向和黑盒攻击之间的鸿沟,黑盒设置上的对抗性例子的可转移性可以改进。关于CIFAR10和图像网络数据集的广泛实验显示,我们的架构超越了黑盒和白盒攻击环境的最先进方法。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日