神经网络中追踪数据中毒袭击数据 (Traceback of Data Poisoning Attacks in Neural Networks)

In adversarial machine learning, new defenses against attacks on deep learning systems are routinely broken soon after their release by more powerful attacks. In this context, forensic tools can offer a valuable complement to existing defenses, by tracing back a successful attack to its root cause, and offering a path forward for mitigation to prevent similar attacks in the future. In this paper, we describe our efforts in developing a forensic traceback tool for poison attacks on deep neural networks. We propose a novel iterative clustering and pruning solution that trims "innocent" training samples, until all that remains is the set of poisoned data responsible for the attack. Our method clusters training samples based on their impact on model parameters, then uses an efficient data unlearning method to prune innocent clusters. We empirically demonstrate the efficacy of our system on three types of dirty-label (backdoor) poison attacks and three types of clean-label poison attacks, across domains of computer vision and malware classification. Our system achieves over 98.4% precision and 96.8% recall across all attacks. We also show that our system is robust against four anti-forensics measures specifically designed to attack it.

翻译：在对抗性机器的学习中,针对对深层学习系统的攻击的新防御系统在通过更强大的攻击释放后很快就经常被打破。在这方面,法医工具可以对现有防御系统提供宝贵的补充,其方法是追踪一次成功的攻击,找到其根源,并为减缓今后类似的攻击提供一条前进的道路。在本文中,我们描述了我们开发一个用于对深层神经网络进行毒害袭击的法医学追踪工具的努力。我们提议了一个新型的迭代集群和修剪解决方案,将“无辜”训练样品裁剪,直到剩下的全部是应对攻击负责的有毒数据组。我们的方法组集样本基于其对模型参数的影响,然后使用高效的数据解学方法来淡化无辜群组。我们从经验上展示了我们系统在三种类型的肮脏标签(后门)毒攻击和三种类型的清洁标签毒攻击方面的功效,这三种类型是计算机视觉和恶意分类。我们系统在各种攻击中达到98.4%的精确度和96.8%的回顾力。我们还表明,我们的系统对专门设计用来攻击它的四种抗反毒措施是强大的。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

【图神经网络导论】Intro to Graph Neural Networks，176页ppt

专知会员服务

127+阅读 · 2021年6月4日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日