State-of-the-art machine learning models are vulnerable to data poisoning attacks whose purpose is to undermine the integrity of the model. However, the current literature on data poisoning attacks is mainly focused on ad hoc techniques that are only applicable to specific machine learning models. Additionally, the existing data poisoning attacks in the literature are limited to either binary classifiers or to gradient-based algorithms. To address these limitations, this paper first proposes a novel model-free label-flipping attack based on the multi-modality of the data, in which the adversary targets the clusters of classes while constrained by a label-flipping budget. The complexity of our proposed attack algorithm is linear in time over the size of the dataset. Also, the proposed attack can increase the error up to two times for the same attack budget. Second, a novel defense technique based on the Synthetic Reduced Nearest Neighbor (SRNN) model is proposed. The defense technique can detect and exclude flipped samples on the fly during the training procedure. Through extensive experimental analysis, we demonstrate that (i) the proposed attack technique can deteriorate the accuracy of several models drastically, and (ii) under the proposed attack, the proposed defense technique significantly outperforms other conventional machine learning models in recovering the accuracy of the targeted model.
翻译:然而,目前关于数据中毒袭击的文献主要侧重于只适用于特定机器学习模型的特设技术。此外,文献中现有的数据中毒袭击仅限于二进制分类器或梯度算法。为了解决这些局限性,本文件首先提议根据数据多式模式,采用新的无模型标签脱钩攻击,敌对方针对各类组群,但受标签擦拭预算限制。我们拟议的攻击算法的复杂性在时间上直线超过数据集的大小。此外,拟议的攻击可能使同一攻击预算中的错误增加高达两次。第二,根据合成减少近邻网(SRNN)模型提出了新的防御技术。在培训过程中,国防技术可以探测和排除飞翔上的翻转样品。通过广泛的实验分析,我们证明(一) 拟议的攻击技术可以使若干模型的准确性急剧下降,并在拟议目标攻击模型下大幅改进常规防御模型。在正在恢复的常规防御模型下,拟议采用新的防御技术。