零Grad : 减少和解释在密克罗尼西亚联邦的逆向培训中的灾难性过度改造 (ZeroGrad : Mitigating and Explaining Catastrophic Overfitting in FGSM Adversarial Training)

Making deep neural networks robust to small adversarial noises has recently been sought in many applications. Adversarial training through iterative projected gradient descent (PGD) has been established as one of the mainstream ideas to achieve this goal. However, PGD is computationally demanding and often prohibitive in case of large datasets and models. For this reason, single-step PGD, also known as FGSM, has recently gained interest in the field. Unfortunately, FGSM-training leads to a phenomenon called ``catastrophic overfitting," which is a sudden drop in the adversarial accuracy under the PGD attack. In this paper, we support the idea that small input gradients play a key role in this phenomenon, and hence propose to zero the input gradient elements that are small for crafting FGSM attacks. Our proposed idea, while being simple and efficient, achieves competitive adversarial accuracy on various datasets.

翻译：最近许多应用中都寻求通过迭代预测梯度下降(PGD)进行反向培训,这是实现这一目标的主流思想之一。然而,在大型数据集和模型的情况下,PGD在计算上要求很高,而且往往令人望而却步,因此,被称为FGSM的单步PGD最近对实地越来越感兴趣。不幸的是,FGSM培训导致一种名为“灾难性过度装配”的现象,这是PGD攻击下对抗性精确度的突然下降。在本文中,我们支持小投入梯度在这一现象中发挥关键作用的想法,因此建议取消用于制造FGSM攻击的较小输入梯度元素。我们提议的构想虽然简单而有效,但在各种数据集上实现了竞争性对抗性准确性。

相关内容

过拟合

关注 8

过拟合，在AI领域多指机器学习得到模型太过复杂，导致在训练集上表现很好，然而在测试集上却不尽人意。过拟合（over-fitting）也称为过学习，它的直观表现是算法在训练集上表现好，但在测试集上表现不好，泛化性能差。过拟合是在模型参数拟合过程中由于训练数据包含抽样误差，在训练时复杂的模型将抽样误差也进行了拟合导致的。

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【斯坦福大学-ICLR2020】图神经网络预训练的策略，Strategies for Pre-training Graph Neural Networks

专知会员服务

78+阅读 · 2020年3月1日