ASSET: 跨越多种深学习范式的强有力后门数据探测 (ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms)

Backdoor data detection is traditionally studied in an end-to-end supervised learning (SL) setting. However, recent years have seen the proliferating adoption of self-supervised learning (SSL) and transfer learning (TL), due to their lesser need for labeled data. Successful backdoor attacks have also been demonstrated in these new settings. However, we lack a thorough understanding of the applicability of existing detection methods across a variety of learning settings. By evaluating 56 attack settings, we show that the performance of most existing detection methods varies significantly across different attacks and poison ratios, and all fail on the state-of-the-art clean-label attack. In addition, they either become inapplicable or suffer large performance losses when applied to SSL and TL. We propose a new detection method called Active Separation via Offset (ASSET), which actively induces different model behaviors between the backdoor and clean samples to promote their separation. We also provide procedures to adaptively select the number of suspicious points to remove. In the end-to-end SL setting, ASSET is superior to existing methods in terms of consistency of defensive performance across different attacks and robustness to changes in poison ratios; in particular, it is the only method that can detect the state-of-the-art clean-label attack. Moreover, ASSET's average detection rates are higher than the best existing methods in SSL and TL, respectively, by 69.3% and 33.2%, thus providing the first practical backdoor defense for these new DL settings. We open-source the project to drive further development and encourage engagement: https://github.com/ruoxi-jia-group/ASSET.

翻译：传统上,在终端到终端监管的学习(SL)环境中研究后门数据检测。然而,近年来,由于对标签数据的需求较少,自我监督学习(SSL)和转移学习(TL)的采用率呈上升趋势,原因是对标签数据的需求较少。在这些新环境下也展示了成功的后门袭击。然而,我们对于现有检测方法在各种学习环境中的适用性缺乏透彻的理解。通过对56个袭击环境进行评估,我们发现,大多数现有检测方法的性能在不同袭击和毒药比率之间有很大差异,而且所有最先进的清洁标签袭击都失败了。此外,它们要么变得不适用,要么在应用SL和TL时遭受了巨大的性能损失。我们提出了一种名为“通过Offset(ASSET)主动分离”的新型检测方法,积极诱导出后门和干净样本之间不同的模式行为,以促进其分离。我们还提供了适应性地选择要删除的可疑点的程序。在SLFOF2的后端设置中,ASET比现有的最佳防御性评估方法要优于现有方法。提供不同袭击中的最佳防御性操作,因此,SASL3的SARSL的检测率是SASAR标准,因此,在SASAR标准中可以分别测测测测测测出现有标准。我们测算出现有标准。

相关内容

ASSETS

关注 0

ACM SIGACCESS Conference on Computers and Accessibility是为残疾人和老年人提供与计算机相关的设计、评估、使用和教育研究的首要论坛。我们欢迎提交原始的高质量的有关计算和可访问性的主题。今年，ASSETS首次将其范围扩大到包括关于计算机无障碍教育相关主题的原创高质量研究。官网链接：http://assets19.sigaccess.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日