As deep image forgery powered by AI generative models, such as GANs, continues to challenge today's digital world, detecting AI-generated forgeries has become a vital security topic. Generalizability and robustness are two critical concerns of a forgery detector, determining its reliability when facing unknown GANs and noisy samples in an open world. Although many studies focus on improving these two properties, the root causes of these problems have not been fully explored, and it is unclear if there is a connection between them. Moreover, despite recent achievements in addressing these issues from image forensic or anti-forensic aspects, a universal method that can contribute to both sides simultaneously remains practically significant yet unavailable. In this paper, we provide a fundamental explanation of these problems from a frequency perspective. Our analysis reveals that the frequency bias of a DNN forgery detector is a possible cause of generalization and robustness issues. Based on this finding, we propose a two-step frequency alignment method to remove the frequency discrepancy between real and fake images, offering double-sided benefits: it can serve as a strong black-box attack against forgery detectors in the anti-forensic context or, conversely, as a universal defense to improve detector reliability in the forensic context. We also develop corresponding attack and defense implementations and demonstrate their effectiveness, as well as the effect of the frequency alignment method, in various experimental settings involving twelve detectors, eight forgery models, and five metrics.
翻译:随着以GANs等AI生成模型驱动的深度图像伪造技术持续挑战当今数字世界,检测AI生成的伪造内容已成为至关重要的安全议题。泛化性与鲁棒性是伪造检测器的两个关键考量,决定了其在开放世界中面对未知GAN模型和含噪样本时的可靠性。尽管许多研究致力于提升这两项性能,但这些问题产生的根本原因尚未被充分探究,且二者之间是否存在关联尚不明确。此外,尽管近期在图像取证或反取证方面解决这些问题取得了进展,但能同时提升两方面性能的通用方法仍具有重要现实意义却尚未实现。本文从频率视角为这些问题提供了根本性解释。我们的分析表明,DNN伪造检测器的频率偏差可能是导致泛化与鲁棒性问题的原因。基于此发现,我们提出了一种两步频率对齐方法以消除真实图像与伪造图像间的频率差异,该方法具有双重效益:在反取证场景中可作为针对伪造检测器的强效黑盒攻击手段;反之,在取证场景中可作为提升检测器可靠性的通用防御策略。我们还开发了相应的攻击与防御实施方案,并在涉及十二种检测器、八种伪造模型及五项评估指标的多种实验设置中,验证了其有效性以及频率对齐方法的作用。