Adversarial training (AT) in order to achieve adversarial robustness wrt single $l_p$-threat models has been discussed extensively. However, for safety-critical systems adversarial robustness should be achieved wrt all $l_p$-threat models simultaneously. In this paper we develop a simple and efficient training scheme to achieve adversarial robustness against the union of $l_p$-threat models. Our novel $l_1+l_\infty$-AT scheme is based on geometric considerations of the different $l_p$-balls and costs as much as normal adversarial training against a single $l_p$-threat model. Moreover, we show that using our $l_1+l_\infty$-AT scheme one can fine-tune with just 3 epochs any $l_p$-robust model (for $p \in \{1,2,\infty\}$) and achieve multiple norm adversarial robustness. In this way we boost the previous state-of-the-art reported for multiple-norm robustness by more than $6\%$ on CIFAR-10 and report up to our knowledge the first ImageNet models with multiple norm robustness. Moreover, we study the general transfer of adversarial robustness between different threat models and in this way boost the previous SOTA $l_1$-robustness on CIFAR-10 by almost $10\%$.
翻译:为实现对抗性强力单一美元威胁模式,已经广泛讨论了Aversarial培训(AT),以实现对抗性强力单一美元威胁模式。然而,对于安全临界系统,对抗性强力应同时实现所有美元威胁模式。在本文件中,我们制定了一个简单有效的培训计划,以实现对抗美元威胁模式联盟的对抗性强力。我们的新颖的$1+1+l ⁇ infty-AT计划是基于对不同美元Balls和成本的几何考虑,同样也是针对单一美元威胁模式的正常对抗性培训。此外,我们表明,使用我们的$_1+l ⁇ infty-AT计划可以同时实现所有美元威胁模式。我们开发一个简单有效的培训计划,以实现对抗美元威胁模式联盟的对抗性强力对抗性强力。 我们通过这种方式,我们用超过6美元美元威胁模式报告了我们先前的多诺美的多面强力风险模式。