We propose a versatile framework based on random search, Sparse-RS, for score-based sparse targeted and untargeted attacks in the black-box setting. Sparse-RS does not rely on substitute models and achieves state-of-the-art success rate and query efficiency for multiple sparse attack models: $l_0$-bounded perturbations, adversarial patches, and adversarial frames. The $l_0$-version of untargeted Sparse-RS outperforms all black-box and even all white-box attacks for different models on MNIST, CIFAR-10, and ImageNet. Moreover, our untargeted Sparse-RS achieves very high success rates even for the challenging settings of $20\times20$ adversarial patches and $2$-pixel wide adversarial frames for $224\times224$ images. Finally, we show that Sparse-RS can be applied to generate targeted universal adversarial patches where it significantly outperforms the existing approaches. The code of our framework is available at https://github.com/fra31/sparse-rs.
翻译:我们提出一个基于随机搜索的多功能框架,即Sprass-RS,用于黑箱设置中基于分数的稀少目标攻击和非目标攻击。Sprass-RS并不依赖替代模型,而是在多种稀有攻击模型中达到最先进的成功率和查询效率:$l_0美元绑定的扰动、对抗性补丁和对抗性框架。非目标的Sprass-RS的转换值超过所有黑箱,甚至白箱攻击,用于MNIST、CIFAR-10和图像网络的不同模型。此外,我们非目标的Sprass-RS取得了非常高的成功率,即使是具有挑战性的20美元对抗性补丁和224\times224美元图像的216美元宽比素框架。最后,我们表明,可以应用Sprass-RS生成有针对性的普遍对抗性补丁,在那里它大大超过现有办法。我们的框架代码可以在https://github.com/fra31/spassergy-rassy-rs上查阅。