Semi-supervised learning methods can train high-accuracy machine learning models with a fraction of the labeled training samples required for traditional supervised learning. Such methods do not typically involve close review of the unlabeled training samples, making them tempting targets for data poisoning attacks. In this paper we investigate the vulnerabilities of semi-supervised learning methods to backdoor data poisoning attacks on the unlabeled samples. We show that simple poisoning attacks that influence the distribution of the poisoned samples' predicted labels are highly effective - achieving an average attack success rate as high as 96.9%. We introduce a generalized attack framework targeting semi-supervised learning methods to better understand and exploit their limitations and to motivate future defense strategies.
翻译:半监督的学习方法可以培训高精密机床学习模式,其中含有传统监督学习所需的有标签的培训样本的一小部分。这类方法通常不涉及仔细审查未贴标签的培训样本,使它们成为数据中毒袭击的诱饵。在本文件中,我们调查了半监督的学习方法对后门数据中毒袭击未贴标签样本的脆弱性。我们表明,影响有毒样品预测标签分布的简单中毒袭击非常有效 — — 达到96.9%的平均攻击成功率。我们引入了针对半监督的学习方法的普遍攻击框架,以更好地了解和利用其局限性并激励今后的防御战略。