Attack Ensemble (AE), which combines multiple attacks together, provides a reliable way to evaluate adversarial robustness. In practice, AEs are often constructed and tuned by human experts, which however tends to be sub-optimal and time-consuming. In this work, we present AutoAE, a conceptually simple approach for automatically constructing AEs. In brief, AutoAE repeatedly adds the attack and its iteration steps to the ensemble that maximizes ensemble improvement per additional iteration consumed. We show theoretically that AutoAE yields AEs provably within a constant factor of the optimal for a given defense. We then use AutoAE to construct two AEs for $l_{\infty}$ and $l_2$ attacks, and apply them without any tuning or adaptation to 45 top adversarial defenses on the RobustBench leaderboard. In all except one cases we achieve equal or better (often the latter) robustness evaluation than existing AEs, and notably, in 29 cases we achieve better robustness evaluation than the best known one. Such performance of AutoAE shows itself as a reliable evaluation protocol for adversarial robustness, which further indicates the huge potential of automatic AE construction. Code is available at \url{https://github.com/LeegerPENG/AutoAE}.
翻译:将多重攻击合并在一起的“ 攻击集合” 提供了可靠的方法来评估对抗性强健性。 实际上, AE 通常由人类专家建造和调制,但人类专家往往不够优化和耗时。 在这项工作中,我们提出AutoAE,这是自动建造 AE 的简单概念方法。简而言之, AutoAE 反复将攻击及其迭代步骤添加到组合体中,每多经历一次重复,就能最大限度地提高组合性强力。我们理论上表明,AutoAE 产生AE 的常数在给定的防御最优的常数中是可行的。我们随后使用AutoAE 来为$lüinfty} 和$l_2$ 攻击建造两个AE,并在不作任何调整或调整的情况下将其应用到RobustitBechnch 头板上的45个顶级对抗性防御。 除了一个我们比现有的AEEEOV/ 强力评估(通常是后者),特别是29个我们比已知的最佳防御性评估更强力。