During the generation of invisible backdoor attack poisoned data, the feature space transformation operation tends to cause the loss of some poisoned features and weakens the mapping relationship between source images with triggers and target labels, resulting in the need for a higher poisoning rate to achieve the corresponding backdoor attack success rate. To solve the above problems, we propose the idea of feature repair for the first time and introduce the blind watermark technique to repair the poisoned features lost during the generation of poisoned data. Under the premise of ensuring consistent labeling, we propose a low-poisoning rate invisible backdoor attack based on feature repair, named FRIB. Benefiting from the above design concept, the new method enhances the mapping relationship between the source images with triggers and the target labels, and increases the degree of misleading DNNs, thus achieving a high backdoor attack success rate with a very low poisoning rate. Ultimately, the detailed experimental results show that the goal of achieving a high success rate of backdoor attacks with a very low poisoning rate is achieved on all MNIST, CIFAR10, GTSRB, and ImageNet datasets.
翻译:生成隐性后门攻击毒害数据时,特征空间转换操作往往造成某些有毒特征的丢失,削弱带有触发器和目标标签的源图像之间的绘图关系,导致需要提高中毒率,以实现相应的后门攻击成功率。为解决上述问题,我们首次提出特征修复构想,并采用盲水标记技术,修复生成有毒数据时丢失的中毒特征。在确保一致标签的前提下,我们提议基于特征修复的低中毒率隐性后门攻击,称为FRIB。从上述设计概念中受益的新方法加强了带有触发器和目标标签的源图像与目标标签之间的绘图关系,并增加了误导性 DNNN的力度,从而在非常低的中毒率下实现了较高的后门攻击成功率。最后,详细的实验结果显示,所有MNIST、CIFAR10、GTSRB和图像网络数据集都实现了以非常低的中毒率实现高后门攻击成功率的目标。