The proliferation of fake news and its serious negative social influence push fake news detection methods to become necessary tools for web managers. Meanwhile, the multi-media nature of social media makes multi-modal fake news detection popular for its ability to capture more modal features than uni-modal detection methods. However, current literature on multi-modal detection is more likely to pursue the detection accuracy but ignore the robustness of the detector. To address this problem, we propose a comprehensive robustness evaluation of multi-modal fake news detectors. In this work, we simulate the attack methods of malicious users and developers, i.e., posting fake news and injecting backdoors. Specifically, we evaluate multi-modal detectors with five adversarial and two backdoor attack methods. Experiment results imply that: (1) The detection performance of the state-of-the-art detectors degrades significantly under adversarial attacks, even worse than general detectors; (2) Most multi-modal detectors are more vulnerable when subjected to attacks on visual modality than textual modality; (3) Popular events' images will cause significant degradation to the detectors when they are subjected to backdoor attacks; (4) The performance of these detectors under multi-modal attacks is worse than under uni-modal attacks; (5) Defensive methods will improve the robustness of the multi-modal detectors.
翻译:假新闻的扩散及其严重的负面社会影响促使假新闻探测方法成为网络管理者的必要工具。与此同时,社交媒体的多媒体性质使多式假新闻探测成为多式假新闻探测工具,因为它能够捕捉比单式探测方法更多的模式特征。然而,目前关于多式探测的文献更可能追求探测准确性,但忽视探测器的稳健性。为解决这一问题,我们提议对多式假新闻探测器进行全面的稳健性评估。在这项工作中,我们模拟恶意用户和开发者的攻击方法,即张贴假新闻和注射后门。具体地说,我们用五种对抗和两种后门攻击方法评价多式的假新闻探测器。实验结果表明:(1) 最先进的探测器在对抗性攻击下显著下降,甚至比一般探测器更差。(2) 多数多式探测器在受到视觉模式攻击时比文字模式攻击时更容易受到攻击;(3) 大众事件图像在受到后门攻击时会严重降解探测器;(4) 在多式攻击下进行强型传感器的改进。