黑光:对神经网络的可缩放防御 (Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks)

Deep learning systems are known to be vulnerable to adversarial examples. In particular, query-based black-box attacks do not require knowledge of the deep learning model, but can compute adversarial examples over the network by submitting queries and inspecting returns. Recent work largely improves the efficiency of those attacks, demonstrating their practicality on today's ML-as-a-service platforms. We propose Blacklight, a new defense against query-based black-box adversarial attacks. The fundamental insight driving our design is that, to compute adversarial examples, these attacks perform iterative optimization over the network, producing image queries highly similar in the input space. Blacklight detects query-based black-box attacks by detecting highly similar queries, using an efficient similarity engine operating on probabilistic content fingerprints. We evaluate Blacklight against eight state-of-the-art attacks, across a variety of models and image classification tasks. Blacklight identifies them all, often after only a handful of queries. By rejecting all detected queries, Blacklight prevents any attack to complete, even when attackers persist to submit queries after account ban or query rejection. Blacklight is also robust against several powerful countermeasures, including an optimal black-box attack that approximates white-box attacks in efficiency. Finally, we illustrate how Blacklight generalizes to other domains like text classification.

翻译：深层次的学习系统已知很容易受到对抗性攻击的例子。特别是, 以查询为基础的黑盒攻击并不需要深层次学习模式的知识, 而是可以通过提交查询和检查反馈来计算网络上的对抗性例子。最近的工作在很大程度上提高了这些攻击的效率, 展示了这些攻击在今天的 ML- A- 服务平台上的实用性。我们提议了黑灯, 一种防止基于查询的黑盒对抗性攻击的新防御系统。我们设计的基本洞察力是, 要计算敌对性例子, 这些攻击在网络上进行迭代优化, 产生与输入空间非常相似的图像查询。黑灯通过探测高度相似的查询来探测基于查询的黑盒攻击, 使用高效的类似引擎在概率性内容指纹上操作。我们评估了黑灯对八种最先进的攻击的实用性攻击, 跨越了不同的模型和图像分类任务。黑灯识别了所有这些攻击, 通常只经过少量查询。通过拒绝所有检测, 黑灯防止任何攻击完成, 即使攻击者坚持在账户禁止或拒绝后提出查询。黑灯也能够对几个强大的反措施进行有力的检查, 。黑光灯也很强, 。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日