感光攻击:向隐隐形黑盒反反向攻击 (Saliency Attack: Towards Imperceptible Black-box Adversarial Attack)

Deep neural networks are vulnerable to adversarial examples, even in the black-box setting where the attacker is only accessible to the model output. Recent studies have devised effective black-box attacks with high query efficiency. However, such performance is often accompanied by compromises in attack imperceptibility, hindering the practical use of these approaches. In this paper, we propose to restrict the perturbations to a small salient region to generate adversarial examples that can hardly be perceived. This approach is readily compatible with many existing black-box attacks and can significantly improve their imperceptibility with little degradation in attack success rate. Further, we propose the Saliency Attack, a new black-box attack aiming to refine the perturbations in the salient region to achieve even better imperceptibility. Extensive experiments show that compared to the state-of-the-art black-box attacks, our approach achieves much better imperceptibility scores, including most apparent distortion (MAD), $L_0$ and $L_2$ distances, and also obtains significantly higher success rates judged by a human-like threshold on MAD. Importantly, the perturbations generated by our approach are interpretable to some extent. Finally, it is also demonstrated to be robust to different detection-based defenses.

翻译：纵深神经网络很容易受到对抗性实例的影响,即使在攻击者只能进入模型输出的黑箱环境中,攻击者也很容易受到对抗性实例的影响。最近的研究已经设计出有效的黑箱攻击,其查询效率很高。然而,这种表现往往伴随着攻击不易感知的妥协,妨碍了这些方法的实际使用。在本文中,我们建议将扰动限制在一个小的显要区域,以产生几乎无法察觉的敌对性实例。这一方法很容易与许多现有的黑箱攻击相容,并且可以大大提高其不易感知性,在攻击成功率中也很少退化。此外,我们提议进行 " 敬重攻击 ",这是一次新的黑箱攻击,目的是改进突出区域的扰动性,以达到更难受性。广泛的实验表明,与最先进的黑箱攻击相比,我们的方法取得了更难辨识性的分数,包括最明显的扭曲(MAD)、 $L_0和$L_2美元的距离,并且获得某种类似人类的临界值所判断的成功率要高得多。最终,通过对MAD的防御性的方法进行了不同的解释。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日