争取在Black-Box攻击中的视觉扭曲 (Towards Visual Distortion in Black-Box Attacks)

Constructing adversarial examples in a black-box threat model injures the original images by introducing visual distortion. In this paper, we propose a novel black-box attack approach that can directly minimize the induced distortion by learning the noise distribution of the adversarial example, assuming only loss-oracle access to the black-box network. The quantified visual distortion, which measures the perceptual distance between the adversarial example and the original image, is introduced in our loss whilst the gradient of the corresponding non-differentiable loss function is approximated by sampling noise from the learned noise distribution. We validate the effectiveness of our attack on ImageNet. Our attack results in much lower distortion when compared to the state-of-the-art black-box attacks and achieves $100\%$ success rate on InceptionV3, ResNet50 and VGG16bn. The code is available at https://github.com/Alina-1997/visual-distortion-in-attack.

翻译：在黑盒威胁模型中构建对抗性实例会通过引入视觉扭曲而损害原始图像。在本文中,我们建议采用新的黑盒攻击方法,通过了解对抗性例子的噪音分布,可以直接将诱发扭曲降到最低程度,假设只能使用黑盒网络的损耗-oracle。量化的视觉扭曲测量了对抗性例子与原始图像之间的概念距离,在我们的损失中引入了量化的视觉扭曲,而相应的不可区分的损失功能的梯度则通过从所学的噪音分布中取样的噪音来近似。我们验证了我们对图像网络的攻击的有效性。与最先进的黑盒攻击相比,我们的攻击导致的扭曲程度要低得多, InpeptionionV3、ResNet50和VGG16bn的成功率要达到100美元。该代码可在https://github.com/Alina-97/vivical-dortricon-in-in-freat中查阅。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【KDD2020】现实世界超图的结构模式和生成模型，Structural Patterns and Generative Models of Real-world Hypergraphs

专知会员服务

37+阅读 · 2020年6月16日