Deep neural networks have been shown to suffer from critical vulnerabilities under adversarial attacks. This phenomenon stimulated the creation of different attack and defense strategies similar to those adopted in cyberspace security. The dependence of such strategies on attack and defense mechanisms makes the associated algorithms on both sides appear as closely reciprocating processes. The defense strategies are particularly passive in these processes, and enhancing initiative of such strategies can be an effective way to get out of this arms race. Inspired by the dynamic defense approach in cyberspace, this paper builds upon stochastic ensemble smoothing based on defense method of random smoothing and model ensemble. Proposed method employs network architecture and smoothing parameters as ensemble attributes, and dynamically change attribute-based ensemble model before every inference prediction request. The proposed method handles the extreme transferability and vulnerability of ensemble models under white-box attacks. Experimental comparison of ASR-vs-distortion curves with different attack scenarios shows that even the attacker with the highest attack capability cannot easily exceed the attack success rate associated with the ensemble smoothed model, especially under untargeted attacks.
翻译:深心神经网络在对抗性攻击下受到严重脆弱性的影响,这一现象刺激了与网络空间安全类似的不同攻击和防御战略的形成。这种战略对攻击和防御机制的依赖使得双方的相关算法看起来都是紧密的对应过程。防御战略在这些过程中特别被动,加强这种战略的主动性可以成为摆脱军备竞赛的有效途径。在网络空间动态防御方法的启发下,本文件以随机平滑和模型共通的防御方法为基础的零碎合体为基础,以随机平滑的防御方法为基础。拟议方法使用网络架构和平滑参数作为共性属性,并在每次推断预测要求之前动态改变基于属性的共通性模型。拟议方法处理白箱攻击下的共性模型的极端可转移性和脆弱性。ASR-vs-扭曲曲线与不同攻击情景的实验性比较表明,即使是攻击能力最高的攻击者也无法轻易超过与可混合平稳模型相关的攻击成功率,特别是在非目标攻击下。