One common task in image forensics is to detect spliced images, where multiple source images are composed to one output image. Most of the currently best performing splicing detectors leverage high-frequency artifacts. However, after an image underwent strong compression, most of the high frequency artifacts are not available anymore. In this work, we explore an alternative approach to splicing detection, which is potentially better suited for images in-the-wild, subject to strong compression and downsampling. Our proposal is to model the color formation of an image. The color formation largely depends on variations at the scale of scene objects, and is hence much less dependent on high-frequency artifacts. We learn a deep metric space that is on one hand sensitive to illumination color and camera white-point estimation, but on the other hand insensitive to variations in object color. Large distances in the embedding space indicate that two image regions either stem from different scenes or different cameras. In our evaluation, we show that the proposed embedding space outperforms the state of the art on images that have been subject to strong compression and downsampling. We confirm in two further experiments the dual nature of the metric space, namely to both characterize the acquisition camera and the scene illuminant color. As such, this work resides at the intersection of physics-based and statistical forensics with benefits from both sides.
翻译:图像法证工作的一项共同任务是检测成像图象,其中多源图像组成一个输出图像。大多数目前最优秀的断层探测器都利用高频文物。然而,在图像被强烈压缩后,大多数高频文物不再可用。在这项工作中,我们探索了另一种方法来探测图象,这种图象可能更适合在瞬间拍摄图像,但需经过强烈压缩和下层取样。我们的建议是模拟图像的颜色形成。彩色形成主要取决于现场物体规模的变化,因此不那么依赖高频文物。我们学习了一种深度的测量空间,一方面对照明颜色和相机白点估计敏感,另一方面对物体颜色的变化不敏感。在嵌入空间的长距离表明两个图像区域不是来自不同的场景或不同的照相机。我们的建议是模拟图像的颜色形成。我们从两种深层次的测量空间的双重性质,即从这个摄像头到这个摄像头的双重性质。我们从两个摄像头的统计学角度都确认了这种性质。