Although many methods have been proposed to enhance the transferability of adversarial perturbations, these methods are designed in a heuristic manner, and the essential mechanism for improving adversarial transferability is still unclear. This paper summarizes the common mechanism shared by twelve previous transferability-boosting methods in a unified view, i.e., these methods all reduce game-theoretic interactions between regional adversarial perturbations. To this end, we focus on the attacking utility of all interactions between regional adversarial perturbations, and we first discover and prove the negative correlation between the adversarial transferability and the attacking utility of interactions. Based on this discovery, we theoretically prove and empirically verify that twelve previous transferability-boosting methods all reduce interactions between regional adversarial perturbations. More crucially, we consider the reduction of interactions as the essential reason for the enhancement of adversarial transferability. Furthermore, we design the interaction loss to directly penalize interactions between regional adversarial perturbations during attacking. Experimental results show that the interaction loss significantly improves the transferability of adversarial perturbations.
翻译:虽然已提出许多方法来提高对抗性扰动的可转移性,但这些方法的设计过于繁琐,改进对抗性扰动性的基本机制仍然不明确。本文件总结了12种先前的转移性加速方法所共有的共同机制,以统一的观点来看,即这些方法都减少了区域对立性扰动之间的游戏理论互动。为此,我们侧重于区域对立性扰动之间所有相互作用的效用的攻击性,我们首先发现并证明对抗性转移和攻击性互动的效用之间的消极关联性。根据这一发现,我们理论上证明并用经验核查了12种先前的转移性加速方法都减少了区域对立性扰动性之间的相互作用。更重要的是,我们认为减少相互作用是加强对抗性扰动性扰动性互动性的基本理由。此外,我们设计互动损失来直接惩罚区域对立性扰动性扰动性扰动性之间的相互作用。实验结果显示,这种互动性损失大大改善了对抗性扰动性的可转移性。