反向培训分析 (Smoothness Analysis of Adversarial Training)

Deep neural networks are vulnerable to adversarial attacks. Recent studies about adversarial robustness focus on the loss landscape in the parameter space since it is related to optimization and generalization performance. These studies conclude that the difficulty of adversarial training is caused by the non-smoothness of the loss function: i.e., its gradient is not Lipschitz continuous. However, this analysis ignores the dependence of adversarial attacks on model parameters. Since adversarial attacks are optimized for models, they should depend on the parameters. Considering this dependence, we analyze the smoothness of the loss function of adversarial training using the optimal attacks for the model parameter in more detail. We reveal that the constraint of adversarial attacks is one cause of the non-smoothness and that the smoothness depends on the types of the constraints. Specifically, the $L_\infty$ constraint can cause non-smoothness more than the $L_2$ constraint. Moreover, our analysis implies that if we flatten the loss function with respect to input data, the Lipschitz constant of the gradient of adversarial loss tends to increase. To address the non-smoothness, we show that EntropySGD smoothens the non-smooth loss and improves the performance of adversarial training.

翻译：深心神经网络很容易受到对抗性攻击。最近关于对抗性强力的研究侧重于参数空间的损耗景观,因为它与优化和概括性性性能有关。这些研究的结论是,对抗性训练的难度是由损失功能不松动造成的:即其梯度不是Lipschitz的连续性。然而,这一分析忽略了对抗性攻击对模型参数的依赖性。由于对抗性攻击是最佳模型的参数,它们应该取决于参数。考虑到这种依赖性,我们更详细地分析对抗性训练的损失功能的平稳性能,使用最优攻击模型参数的最佳攻击来分析。我们发现,对抗性攻击的制约因素是非松动性的原因之一,而平稳性能取决于各种制约性能的类型。具体地说,$Läinfty$的制约性能可导致非脉冲性攻击大于$L_2美元的制约性能。此外,我们的分析还表明,如果我们在输入数据方面缩小损失功能,则使用对抗性损失加速度的利普西茨恒度,则会增加。为了解决非摩性损失,我们显示的是,我们进行不平稳性调整性培训的结果。

相关内容

损失函数（机器学习）

关注 10

损失函数，在AI中亦称呼距离函数，度量函数。此处的距离代表的是抽象性的，代表真实数据与预测数据之间的误差。损失函数（loss function）是用来估量你模型的预测值f(x)与真实值Y的不一致程度，它是一个非负实值函数,通常使用L(Y, f(x))来表示，损失函数越小，模型的鲁棒性就越好。损失函数是经验风险函数的核心部分，也是结构风险函数重要组成部分。

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【Google】平滑对抗训练，Smooth Adversarial Training

专知会员服务

49+阅读 · 2020年7月4日