In this paper we present TruFor, a forensic framework that can be applied to a large variety of image manipulation methods, from classic cheapfakes to more recent manipulations based on deep learning. We rely on the extraction of both high-level and low-level traces through a transformer-based fusion architecture that combines the RGB image and a learned noise-sensitive fingerprint. The latter learns to embed the artifacts related to the camera internal and external processing by training only on real data in a self-supervised manner. Forgeries are detected as deviations from the expected regular pattern that characterizes each pristine image. Looking for anomalies makes the approach able to robustly detect a variety of local manipulations, ensuring generalization. In addition to a pixel-level localization map and a whole-image integrity score, our approach outputs a reliability map that highlights areas where localization predictions may be error-prone. This is particularly important in forensic applications in order to reduce false alarms and allow for a large scale analysis. Extensive experiments on several datasets show that our method is able to reliably detect and localize both cheapfakes and deepfakes manipulations outperforming state-of-the-art works. Code will be publicly available at https://grip-unina.github.io/TruFor/
翻译:在本文中,我们介绍TruFor, 这个法证框架可以适用于从经典廉价假冒到基于深层次学习的更近的操纵等多种图像操纵方法。 我们依赖通过基于变压器的聚合结构提取高层次和低层次的痕迹,该结构将 RGB 图像与一个对噪音敏感的智能指纹结合起来。 后者学会只通过以自我监督方式对真实数据进行培训,嵌入与相机内部和外部处理有关的艺术品。 伪造被检测为偏离了每个原始图像的预期常规模式。 查找异常现象使方法能够强有力地探测各种本地操纵,确保概括化。 除了基于变压器的本地化图和整体图像完整性评分外,我们的方法还产生一个可靠地图,其中突出地方化预测可能容易出错的领域。 这在法医应用中特别重要,以减少错误的警报,并允许进行大规模分析。 有关若干数据集的广泛实验显示,我们的方法能够可靠地探测和本地化各种本地操作,确保一般化。 廉价的代码/深层次的操控将进行。