Adversarial training suffers from the issue of robust overfitting, which seriously impairs its generalization performance. Data augmentation, which is effective at preventing overfitting in standard training, has been observed by many previous works to be ineffective in mitigating overfitting in adversarial training. This work proves that, contrary to previous findings, data augmentation alone can significantly boost accuracy and robustness in adversarial training. We find that the hardness and the diversity of data augmentation are important factors in combating robust overfitting. In general, diversity can improve both accuracy and robustness, while hardness can boost robustness at the cost of accuracy within a certain limit and degrade them both over that limit. To mitigate robust overfitting, we first propose a new crop transformation, Cropshift, which has improved diversity compared to the conventional one (Padcrop). We then propose a new data augmentation scheme, based on Cropshift, with much improved diversity and well-balanced hardness. Empirically, our augmentation method achieves the state-of-the-art accuracy and robustness for data augmentations in adversarial training. Furthermore, when combined with weight averaging it matches, or even exceeds, the performance of the best contemporary regularization methods for alleviating robust overfitting. Code is available at: https://github.com/TreeLLi/DA-Alone-Improves-AT.
翻译:Adversarial培训受到强力超编问题的影响,这严重地损害了其总体性能; 数据扩编有效防止标准培训超编,以往许多工作都认为,数据扩编在减少对抗性培训超编方面没有实效; 这项工作证明,与以前的调查结果相反,仅数据扩编就能大大提高对抗性培训的准确性和稳健性; 我们发现,数据扩编的难度和多样性是打击强力超编的重要因素; 一般而言,多样性可以提高准确性和稳健性,而硬性则能够以一定限度内的准确性为代价提高稳健性,并降低两者的极限; 为了减轻强健性超标,我们首先提议一种新的作物变换作物,即作物变换,与常规的(Padcrop)相比,提高了多样性; 然后,我们提出一个新的数据扩编计划,以作物变换制为基础,大大提高多样性和平衡性; 我们的扩编方法在对抗性培训中达到最新准确性和稳健性的数据扩编培训。 此外,如果与加权结合,或甚至超过此限限限值,我们首先提出新的作物变换作物变换作物变型方法。