Despite its popularity, deep neural networks are easily fooled. To alleviate this deficiency, researchers are actively developing new training strategies, which encourage models that are robust to small input perturbations. Several successful robust training methods have been proposed. However, many of them rely on strong adversaries, which can be prohibitively expensive to generate when the input dimension is high and the model structure is complicated. We adopt a new perspective on robustness and propose a novel training algorithm that allows a more effective use of adversaries. Our method improves the model robustness at each local ball centered around an adversary and then, by combining these local balls through a global term, achieves overall robustness. We demonstrate that, by maximizing the use of adversaries via focusing on local balls, we achieve high robust accuracy with weak adversaries. Specifically, our method reaches a similar robust accuracy level to the state of the art approaches trained on strong adversaries on MNIST, CIFAR-10 and CIFAR-100. As a result, the overall training time is reduced. Furthermore, when trained with strong adversaries, our method matches with the current state of the art on MNIST and outperforms them on CIFAR-10 and CIFAR-100.
翻译:尽管广受欢迎,但深心神经网络很容易被愚弄。为了减轻这种缺陷,研究人员正在积极制定新的培训战略,这些战略鼓励了对小投入有影响的模式。已经提出了几项成功的强力培训方法。然而,其中许多人依赖强势对手,而当投入层面高而模型结构复杂时,这种对手可能极其昂贵。我们从新的角度看待强力,并提出了一个新的培训算法,以便更有效地利用对手。我们的方法改进了以对手为中心的每个地方球的强力模型,然后,通过全球术语将这些地方球组合起来,从而实现总体强力。我们证明,通过集中关注当地球,我们最大限度地利用对手,我们就能在弱势对手中达到高度稳健的精准性。具体地说,我们的方法达到了类似强的精准度水平,与在多国空间信息和信息系统、CIFAR-10和CIFAR-100上强敌所训练的艺术方法相匹配。结果,总体培训时间缩短。此外,在与强势对手培训时,我们的方法与目前关于多国信息技术的状况相匹配,并在CIFAR-10和CIFAR-100上超越对手。