如何对抗数据中毒的对抗训练 (What Doesn't Kill You Makes You Robust(er): How to Adversarially Train against Data Poisoning)

Data poisoning is a threat model in which a malicious actor tampers with training data to manipulate outcomes at inference time. A variety of defenses against this threat model have been proposed, but each suffers from at least one of the following flaws: they are easily overcome by adaptive attacks, they severely reduce testing performance, or they cannot generalize to diverse data poisoning threat models. Adversarial training, and its variants, are currently considered the only empirically strong defense against (inference-time) adversarial attacks. In this work, we extend the adversarial training framework to defend against (training-time) data poisoning, including targeted and backdoor attacks. Our method desensitizes networks to the effects of such attacks by creating poisons during training and injecting them into training batches. We show that this defense withstands adaptive attacks, generalizes to diverse threat models, and incurs a better performance trade-off than previous defenses such as DP-SGD or (evasion) adversarial training.

翻译：数据中毒是一种威胁模式,恶意行为者篡改培训数据,以在推论时间操纵结果。提出了各种防范这种威胁模式,但每种模式都至少存在以下一个缺陷:它们很容易被适应性攻击所克服,它们严重降低测试性能,或者它们无法推广到各种不同的数据中毒威胁模式。反向培训及其变种目前被认为是唯一具有经验的有力防御(推论时间)对抗性攻击。在这项工作中,我们扩大了对抗性培训框架,以防范(培训-时间)数据中毒,包括目标攻击和后门攻击。我们的方法通过在培训期间制造毒药并将其注入培训批次,使网络对此类攻击的影响失去敏锐性。我们表明,这种防御能够抵御适应性攻击,对多种威胁模式加以概括,并比DP-SGD或(规避)对抗性训练等以往的防御方法产生更好的性交换。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【干货书】隐私保留机器学习，Privacy-Preserving Machine Learning

专知会员服务

27+阅读 · 2022年4月6日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【干货书】数据科学基础，429页pdf，Foundations of Data Science

专知会员服务

65+阅读 · 2021年8月11日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日