Advances in photo editing and manipulation tools have made it significantly easier to create fake imagery. Learning to detect such manipulations, however, remains a challenging problem due to the lack of sufficient amounts of manipulated training data. In this paper, we propose a learning algorithm for detecting visual image manipulations that is trained only using a large dataset of real photographs. The algorithm uses the automatically recorded photo EXIF metadata as supervisory signal for training a model to determine whether an image is self-consistent -- that is, whether its content could have been produced by a single imaging pipeline. We apply this self-consistency model to the task of detecting and localizing image splices. The proposed method obtains state-of-the-art performance on several image forensics benchmarks, despite never seeing any manipulated images at training. That said, it is merely a step in the long quest for a truly general purpose visual forensics tool.
翻译:照片编辑和操作工具的进步使得制作假图像大为容易。 然而,由于缺乏足够的经过操纵的培训数据,学习检测这种操纵仍然是一个具有挑战性的问题。 在本文中,我们提议了一种用于检测视觉图像操纵的学习算法,这种算法仅使用大量真实照片数据集进行培训。算法使用自动记录的图片 EXIF元数据作为监督信号,用于培训一个模型,以确定图像是否自成一体 -- -- 也就是说,其内容是否由单一成像管道生成。我们用这种自定模式来探测和定位图像串联。拟议方法在几个图像法证基准上取得了最新业绩,尽管在培训时从未看到过任何经过操纵的图像。 也就是说,它只是长期追求真正通用的视觉法证工具的一个步骤。