目标:利用强有力的统计数据防范后门攻击 (SPECTRE: Defending Against Backdoor Attacks Using Robust Statistics)

Modern machine learning increasingly requires training on a large collection of data from multiple sources, not all of which can be trusted. A particularly concerning scenario is when a small fraction of poisoned data changes the behavior of the trained model when triggered by an attacker-specified watermark. Such a compromised model will be deployed unnoticed as the model is accurate otherwise. There have been promising attempts to use the intermediate representations of such a model to separate corrupted examples from clean ones. However, these defenses work only when a certain spectral signature of the poisoned examples is large enough for detection. There is a wide range of attacks that cannot be protected against by the existing defenses. We propose a novel defense algorithm using robust covariance estimation to amplify the spectral signature of corrupted data. This defense provides a clean model, completely removing the backdoor, even in regimes where previous methods have no hope of detecting the poisoned examples. Code and pre-trained models are available at https://github.com/SewoongLab/spectre-defense .

翻译：现代机器学习日益需要培训,以便从多种来源收集大量数据,但并非所有数据都是可以信任的。一个特别的情景是,在攻击者指定的水印触发时,一小部分有毒数据改变了经过训练的模型的行为。这种被破坏的模型会被忽略,因为模型在其他情况下是准确的。人们曾有大有希望地试图利用这种模型的中间表示方式将腐败的例子与干净的例子区分开来。然而,这些防御只有在有毒例子的某种光谱特征足以探测出来时才起作用。现有防御无法保护广泛的攻击。我们提议采用新的防御算法,使用强有力的共变法估计来扩大被破坏的数据的光谱特征。这种辩护提供了一个干净的模型,完全清除后门,即使在以前的方法没有希望发现有毒例子的制度中也是如此。可在https://github.com/SewoongLab/specetrefement上找到代码和经过预先训练的模型。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

专知会员服务

186+阅读 · 2021年5月17日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

35+阅读 · 2020年12月28日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日