Transfer-based adversarial example is one of the most important classes of black-box attacks. However, there is a trade-off between transferability and imperceptibility of the adversarial perturbation. Prior work in this direction often requires a fixed but large $\ell_p$-norm perturbation budget to reach a good transfer success rate, leading to perceptible adversarial perturbations. On the other hand, most of the current unrestricted adversarial attacks that aim to generate semantic-preserving perturbations suffer from weaker transferability to the target model. In this work, we propose a geometry-aware framework to generate transferable adversarial examples with minimum changes. Analogous to model selection in statistical machine learning, we leverage a validation model to select the best perturbation budget for each image under both the $\ell_{\infty}$-norm and unrestricted threat models. We propose a principled method for the partition of training and validation models by encouraging intra-group diversity while penalizing extra-group similarity. Extensive experiments verify the effectiveness of our framework on balancing imperceptibility and transferability of the crafted adversarial examples. The methodology is the foundation of our entry to the CVPR'21 Security AI Challenger: Unrestricted Adversarial Attacks on ImageNet, in which we ranked 1st place out of 1,559 teams and surpassed the runner-up submissions by 4.59% and 23.91% in terms of final score and average image quality level, respectively. Code is available at https://github.com/Equationliu/GA-Attack.
翻译:基于转移的敌对性实例是最重要的黑箱攻击类别之一。 但是,在对抗性扰动的可转移性和不可理解性之间存在着一种权衡。 先前朝此方向开展的工作往往需要固定但大为$@ p$- norm 扰动预算,以达到良好的转移成功率,导致明显的对抗性扰动。 另一方面,目前大多数旨在产生语义保留扰动的无限制的敌对性攻击都因向目标模式转移能力较弱而受到影响。 在这项工作中,我们提议了一个几何觉知性框架,以生成可转移的对抗性例子,但变化最小。 在统计性机器学习中,对模型的选择进行分析,我们利用一个验证模式,在$\ell ⁇ infty}$-norm 和不受限制的威胁模式下,为每种图像选择最佳的扰动性预算。 我们提出一个原则性方法,通过鼓励群体内部多样性来分配培训和验证模式,同时对集团内现有平均差异进行惩罚。 广泛实验核实我们在平衡不易理解性提交/ 网络- 格式的可转移性条款的有效性。 在1 AL- refortial vical vial vility asion subility 基础中, view view view view vilate view view view view view view view view viewm viewm view view view viewm