This study provides a new understanding of the adversarial attack problem by examining the correlation between adversarial attack and visual attention change. In particular, we observed that: (1) images with incomplete attention regions are more vulnerable to adversarial attacks; and (2) successful adversarial attacks lead to deviated and scattered attention map. Accordingly, an attention-based adversarial defense framework is designed to simultaneously rectify the attention map for prediction and preserve the attention area between adversarial and original images. The problem of adding iteratively attacked samples is also discussed in the context of visual attention change. We hope the attention-related data analysis and defense solution in this study will shed some light on the mechanism behind the adversarial attack and also facilitate future adversarial defense/attack model design.
翻译:这项研究通过审查对抗性攻击与视觉关注变化之间的关系,为对抗性攻击问题提供了一种新的理解,特别是,我们注意到:(1) 关注不完整的地区更容易受到对抗性攻击;(2) 成功的对抗性攻击导致偏差和分散的注意地图,因此,一个以关注为基础的对抗性防御框架旨在同时纠正预测关注地图,并保持敌对性和原始图像之间的关注领域;在视觉关注变化的背景下,也讨论了添加迭代攻击样品的问题;我们希望本研究报告中与关注有关的数据分析和防御解决办法将使人们了解敌对性攻击背后的机制,并便利未来的对抗性防御/攻击模式设计。