Due to the gap between a substitute model and a victim model, the gradient-based noise generated from a substitute model may have low transferability for a victim model since their gradients are different. Inspired by the fact that the decision boundaries of different models do not differ much, we conduct experiments and discover that the gradients of different models are more similar on the decision boundary than in the original position. Moreover, since the decision boundary in the vicinity of an input image is flat along most directions, we conjecture that the boundary gradients can help find an effective direction to cross the decision boundary of the victim models. Based on it, we propose a Boundary Fitting Attack to improve transferability. Specifically, we introduce a method to obtain a set of boundary points and leverage the gradient information of these points to update the adversarial examples. Notably, our method can be combined with existing gradient-based methods. Extensive experiments prove the effectiveness of our method, i.e., improving the success rate by 5.6% against normally trained CNNs and 14.9% against defense CNNs on average compared to state-of-the-art transfer-based attacks. Further we compare transformers with CNNs, the results indicate that transformers are more robust than CNNs. However, our method still outperforms existing methods when attacking transformers. Specifically, when using CNNs as substitute models, our method obtains an average attack success rate of 58.2%, which is 10.8% higher than other state-of-the-art transfer-based attacks.
翻译:由于替代模型和受害者模型之间存在差距,替代模型产生的梯度噪音可能由于受害者模型的梯度差异而使受害者模型的可转移性较低,因为其梯度不同。受不同模型的决定界限没有多大差异这一事实的启发,我们进行实验并发现,不同模型的梯度在决定边界上比最初位置更为相似。此外,由于输入图像附近的决定边界与大多数方向相距不远,我们推测,由于替代模型产生的梯度可以帮助找到有效的方向,跨越受害者模型的决定边界。在此基础上,我们建议建立一个适合攻击的边界,以改善可转移性。具体地说,我们采用一种方法获取一套边界点并利用这些点的梯度信息来更新对抗实例。值得注意的是,我们的方法可以与现有的梯度方法相结合。由于我们的方法的有效性,即,相对于通常经过培训的CNN的CNN,提高5.6%的成功率,而相对防御CNN的平均速度而言,我们建议采用一个更适合基于状态的转移模式。我们将58级袭击的变压器与现在的CNN方法相比,当我们采用更稳健的CNN的变压方法时,结果会显示我们的现有变压方法比现在的CNN的方法。</s>