The transferability and robustness of adversarial examples are two practical yet important properties for black-box adversarial attacks. In this paper, we explore effective mechanisms to boost both of them from the perspective of network hierarchy, where a typical network can be hierarchically divided into output stage, intermediate stage and input stage. Since over-specialization of source model, we can hardly improve the transferability and robustness of the adversarial perturbations in the output stage. Therefore, we focus on the intermediate and input stages in this paper and propose a transferable and robust adversarial perturbation generation (TRAP) method. Specifically, we propose the dynamically guided mechanism to continuously calculate accurate directional guidances for perturbation generation in the intermediate stage. In the input stage, instead of the single-form transformation augmentations adopted in the existing methods, we leverage multiform affine transformation augmentations to further enrich the input diversity and boost the robustness and transferability of the adversarial perturbations. Extensive experiments demonstrate that our TRAP achieves impressive transferability and high robustness against certain interferences.
翻译:对抗性实例的可转移性和稳健性是黑盒对抗性攻击的两个实际但重要的特性。在本文中,我们探索了从网络等级结构的角度提升两者的有效机制,其中典型的网络可以按等级划分为产出阶段、中间阶段和输入阶段。由于源模式的过度专业化,我们很难提高产出阶段对立性扰动的可转移性和稳健性。因此,我们注重本文件的中间和输入阶段,并提出一种可转移和稳健的对抗性扰动生成(TRAP)方法。具体地说,我们提出了动态指导机制,以不断计算中间阶段扰动性生成的准确方向指导。在输入阶段,而不是现有方法采用的单一形式变形增强,我们利用多种形式成形的变形增强,以进一步丰富投入的多样性,提高对抗性扰动性扰动的稳健性和可转移性。广泛的实验表明,我们的TRAP在防止某些干扰方面实现了令人印象深刻的可转移性和高度稳健性。