Adversarial training (AT) methods have been found to be effective against adversarial attacks on deep neural networks. Many variants of AT have been proposed to improve its performance. Pang et al. [1] have recently shown that incorporating hypersphere embedding (HE) into the existing AT procedures enhances robustness. We observe that the existing AT procedures are not designed for the HE framework, and thus fail to adequately learn the angular discriminative information available in the HE framework. In this paper, we propose integrating HE into AT with regularization terms that exploit the rich angular information available in the HE framework. Specifically, our method, termed angular-AT, adds regularization terms to AT that explicitly enforce weight-feature compactness and inter-class separation; all expressed in terms of angular features. Experimental results show that angular-AT further improves adversarial robustness.
翻译:反向培训(AT)方法被认为对对付对深神经网络的对抗性攻击是有效的,已经提出了许多反向培训的变种,以提高其性能。Pang等人(Pang等人)最近[1]表明,将超视距嵌入(HE)纳入现有的AT程序会增强强健性。我们认为,现有的AT程序不是为HE框架设计的,因此无法充分了解HE框架内可获得的角化歧视信息。在本文中,我们提议将HE纳入AT的正规化条款,利用HE框架内现有的丰富的三角信息。具体地说,我们的方法,称为角-AT,增加了对AT的正规化条款,以明确强制实施重量-体力紧和阶级间分离;所有这些都以角性特征表示。实验结果显示,角-AT将进一步提高对抗性强健性。</s>