Deep neural networks (DNNs) are well-known to be vulnerable to adversarial attacks, where malicious human-imperceptible perturbations are included in the input to the deep network to fool it into making a wrong classification. Recent studies have demonstrated that neural Ordinary Differential Equations (ODEs) are intrinsically more robust against adversarial attacks compared to vanilla DNNs. In this work, we propose a stable neural ODE with Lyapunov-stable equilibrium points for defending against adversarial attacks (SODEF). By ensuring that the equilibrium points of the ODE solution used as part of SODEF is Lyapunov-stable, the ODE solution for an input with a small perturbation converges to the same solution as the unperturbed input. We provide theoretical results that give insights into the stability of SODEF as well as the choice of regularizers to ensure its stability. Our analysis suggests that our proposed regularizers force the extracted feature points to be within a neighborhood of the Lyapunov-stable equilibrium points of the ODE. SODEF is compatible with many defense methods and can be applied to any neural network's final regressor layer to enhance its stability against adversarial attacks.
翻译:众所周知,深心神经网络(DNNS)容易受到对抗性攻击的伤害,在深心网络的输入中包括了恶意的人类无法察觉的干扰,以欺骗它进行错误的分类。最近的研究显示,神经普通差异(ODEs)与香草DNNNs相比,在本质上对对抗性攻击更为强大。在这项工作中,我们建议使用稳定的神经ODE,使用利帕普诺夫稳定的平衡点来防御对抗对抗性攻击(SODEF)。通过确保作为SODEF一部分使用的ODE溶液的平衡点是Lyapunov-ssable的平衡点,一个小扰动性输入的ODES解决方案与未受干扰的输入的解决方案汇合在一起。我们提供了理论结果,对SODEF的稳定性以及调控者确保其稳定性的选择有了深刻的认识。我们的分析表明,我们提议的调控者将提取的特征点强制在ODESODSOF的相邻区内。SODEF与许多防御性方法兼容,可以用来对抗任何稳定的神经系统。