Due to the vulnerability of deep neural networks, the black-box attack has drawn great attention from the community. Though transferable priors decrease the query number of the black-box query attacks in recent efforts, the average number of queries is still larger than 100, which is easily affected by the number of queries limit policy. In this work, we propose a novel method called query prior-based method to enhance the family of fast gradient sign methods and improve their attack transferability by using a few queries. Specifically, for the untargeted attack, we find that the successful attacked adversarial examples prefer to be classified as the wrong categories with higher probability by the victim model. Therefore, the weighted augmented cross-entropy loss is proposed to reduce the gradient angle between the surrogate model and the victim model for enhancing the transferability of the adversarial examples. Theoretical analysis and extensive experiments demonstrate that our method could significantly improve the transferability of gradient-based adversarial attacks on CIFAR10/100 and ImageNet and outperform the black-box query attack with the same few queries.
翻译:由于深层神经网络的脆弱性,黑盒攻击引起了社区的极大关注。虽然可转移的前身在最近的努力中减少了黑盒查询攻击的查询次数,但平均查询次数仍然超过100次,这很容易受到查询限制政策数目的影响。在这项工作中,我们提议了一种新颖的方法,称为先发询问方法,用几个查询来增强快速梯度标志方法的家庭,并改善其攻击性。具体地说,对非目标攻击而言,我们发现成功攻击的对抗性例子更倾向于被受害者模型列为错误类别,其概率较高。因此,加权增加的交叉渗透性损失是为了减少代孕模型与受害者模型之间的梯度角,以加强对抗性例子的可转移性。理论分析和广泛的实验表明,我们的方法可以大大改进对CIFAR10100和图像网络的梯度对立式攻击的可转移性,并用同样的几个查询比黑盒调查攻击更明显。