Universal Adversarial Perturbations (UAPs) are a prominent class of adversarial examples that exploit the systemic vulnerabilities and enable physically realizable and robust attacks against Deep Neural Networks (DNNs). UAPs generalize across many different inputs; this leads to realistic and effective attacks that can be applied at scale. In this paper we propose HyperNeuron, an efficient and scalable algorithm that allows for the real-time detection of UAPs by identifying suspicious neuron hyper-activations. Our results show the effectiveness of HyperNeuron on multiple tasks (image classification, object detection), against a wide variety of universal attacks, and in realistic scenarios, like perceptual ad-blocking and adversarial patches. HyperNeuron is able to simultaneously detect both adversarial mask and patch UAPs with comparable or better performance than existing UAP defenses whilst introducing a significantly reduced latency of only 0.86 milliseconds per image. This suggests that many realistic and practical universal attacks can be reliably mitigated in real-time, which shows promise for the robust deployment of machine learning systems.
翻译:通用反扰动(UAPs)是一系列突出的对抗性例子,它们利用了系统脆弱性,能够对深神经网络(DNNS)进行实际可实现的强力攻击。 UAPs对许多不同的投入进行概括化,从而导致可以大规模应用的现实而有效的攻击。在本文中,我们建议采用超Neuron, 一种高效且可扩缩的算法,通过识别可疑的神经神经超活性来实时检测UAPs。我们的结果表明,超Neuron对多种任务(图像分类、物体探测)、对各种普遍攻击和现实情景(如感知性阻塞和对抗性对立补补)的效用是有效的。超Neuron能够同时检测到与现有UAP防御系统类似或更好的对抗面罩和对准UAPs的功能,同时引入了仅0.86毫秒的大幅降低的视线。这表明,许多现实和实用的普遍攻击在实时可以可靠地减轻,这显示了机器学习系统的强大部署的希望。