It is well-known that standard neural networks, even with a high classification accuracy, are vulnerable to small $\ell_\infty$-norm bounded adversarial perturbations. Although many attempts have been made, most previous works either can only provide empirical verification of the defense to a particular attack method, or can only develop a certified guarantee of the model robustness in limited scenarios. In this paper, we seek for a new approach to develop a theoretically principled neural network that inherently resists $\ell_\infty$ perturbations. In particular, we design a novel neuron that uses $\ell_\infty$-distance as its basic operation (which we call $\ell_\infty$-dist neuron), and show that any neural network constructed with $\ell_\infty$-dist neurons (called $\ell_{\infty}$-dist net) is naturally a 1-Lipschitz function with respect to $\ell_\infty$-norm. This directly provides a rigorous guarantee of the certified robustness based on the margin of prediction outputs. We then prove that such networks have enough expressive power to approximate any 1-Lipschitz function with robust generalization guarantee. We further provide a holistic training strategy that can greatly alleviate optimization difficulties. Experimental results show that using $\ell_{\infty}$-dist nets as basic building blocks, we consistently achieve state-of-the-art performance on commonly used datasets: 93.09% certified accuracy on MNIST ($\epsilon=0.3$), 35.42% on CIFAR-10 ($\epsilon=8/255$) and 16.31% on TinyImageNet ($\epsilon=1/255$).
翻译:众所周知,标准的神经网络,即使分类精确度很高,也容易受到小的美元(ell@inffty) 美元(norm) 约束的内脏网络的干扰。虽然已经做了许多尝试,但大多数先前的工程都只能对特定攻击方法的防御进行实证核查,或者只能在有限的情景中为模型的稳健性提供经认证的保证。在本文中,我们寻求一种新的方法来开发一个理论上有原则的神经网络,这种网络本可以抵制$(ell_infty) 美元(nurb) 。特别是,我们设计了一个新颖的神经网络,用$(ell_infty) 美元(norm) 作为基本操作(我们称之为$(infty) 美元(dist) 神经元(或dist netnetnown) 进行实证性能验证的坚固度($(nell_infinty) 美元(nutrial) 。我们随后可以证明,任何这样的网络在预测输出的基底值(ral-alalal) pralalalalal destrital reslistal) exlistal a real relistal relistal sal sal sal sal sal sal sal sal sal sal suplutislutislutislupluplupluplupal) 这样的功能可以进一步展示出一个稳定的网络能。