Backdoor attacks have been demonstrated as a security threat for machine learning models. Traditional backdoor attacks intend to inject backdoor functionality into the model such that the backdoored model will perform abnormally on inputs with predefined backdoor triggers and still retain state-of-the-art performance on the clean inputs. While there are already some works on backdoor attacks on Graph Neural Networks (GNNs), the backdoor trigger in the graph domain is mostly injected into random positions of the sample. There is no work analyzing and explaining the backdoor attack performance when injecting triggers into the most important or least important area in the sample, which we refer to as trigger-injecting strategies MIAS and LIAS, respectively. Our results show that, generally, LIAS performs better, and the differences between the LIAS and MIAS performance can be significant. Furthermore, we explain these two strategies' similar (better) attack performance through explanation techniques, which results in a further understanding of backdoor attacks in GNNs.
翻译:后门攻击已被证明对机器学习模型构成安全威胁。传统后门攻击旨在将后门功能注入到模型中,使得带有预定义后门触发器的输入在其中表现异常,而在干净输入上保持最先进的性能。虽然已经有一些关于图神经网络(GNN)上的后门攻击工作,但是在图域中,后门触发器大多注入到样本的随机位置。没有工作分析和解释在样本的最重要或最不重要区域注入触发器的后门攻击性能,我们称之为触发器注入策略 MIAS 和 LIAS。我们的结果表明,通常情况下,LIAS 的表现更好,并且 LIAS 和 MIAS 之间的差异可能非常显著。此外,我们通过解释技术解释了这两种策略类似(更好)的攻击性能,从而进一步了解了 GNN 中的后门攻击。