Adversarial training has been demonstrated to be one of the most effective remedies for defending adversarial examples, yet it often suffers from the huge robustness generalization gap on unseen testing adversaries, deemed as the adversarially robust generalization problem. Despite the preliminary understandings devoted to adversarially robust generalization, little is known from the architectural perspective. To bridge the gap, this paper for the first time systematically investigated the relationship between adversarially robust generalization and architectural design. Inparticular, we comprehensively evaluated 20 most representative adversarially trained architectures on ImageNette and CIFAR-10 datasets towards multiple `p-norm adversarial attacks. Based on the extensive experiments, we found that, under aligned settings, Vision Transformers (e.g., PVT, CoAtNet) often yield better adversarially robust generalization while CNNs tend to overfit on specific attacks and fail to generalize on multiple adversaries. To better understand the nature behind it, we conduct theoretical analysis via the lens of Rademacher complexity. We revealed the fact that the higher weight sparsity contributes significantly towards the better adversarially robust generalization of Transformers, which can be often achieved by the specially-designed attention blocks. We hope our paper could help to better understand the mechanism for designing robust DNNs. Our model weights can be found at http://robust.art.
翻译:事实证明,Adversarial培训是捍卫对抗性实例的最有效补救办法之一,然而,它往往在被视为对抗性强的对抗性一般化问题上,在隐蔽测试对手方面存在着巨大的稳健性一般化差距,被认为是对抗性强的对抗性一般化问题。尽管初步谅解致力于对敌对性强强的概括化,但从建筑角度看却鲜为人知。为了弥合这一差距,本文件首次系统地调查了敌对性强的概括化和建筑设计之间的关系。总体而言,我们全面评价了图象网和CIFAR-10上20个最具代表性的对抗性辩论性培训结构,这些结构面向多重`p-norm对抗性攻击。根据广泛的实验,我们发现,在统一的环境下,愿景变换者(例如PVT、CoatNet)往往产生更好的对抗性强势一般化,而CNN往往过分适应特定攻击,无法对多重对手进行概括化。为了更好地了解它的性质,我们通过雷德马赫赫的复杂性透视镜进行理论分析。我们揭示了一个事实,高重的重度压力对改进了辩论性一般化的面面面面面面面面力,我们能够发现,我们更强烈的变压制的图的图能得到更好的理解。</s>