In this paper, we study the adversarial examples existence and adversarial training from the standpoint of convergence and provide evidence that pointwise convergence in ANNs can explain these observations. The main contribution of our proposal is that it relates the objective of the evasion attacks and adversarial training with concepts already defined in learning theory. Also, we extend and unify some of the other proposals in the literature and provide alternative explanations on the observations made in those proposals. Through different experiments, we demonstrate that the framework is valuable in the study of the phenomenon and is applicable to real-world problems.
翻译:在本文中,我们从趋同的角度研究对抗性实例的存在和对抗性培训,并提供证据,说明非本国人员中最明智的趋同可以解释这些看法,我们提案的主要贡献是,它把规避攻击和对抗性培训的目标与学习理论中已经界定的概念联系起来,我们还扩展和统一文献中的一些其他建议,并对这些建议中提出的意见提供其他解释,通过不同的实验,我们证明该框架在研究这一现象方面很有价值,并适用于现实世界的问题。