In this paper, we address the problem of image splicing localization with a multi-stream network architecture that processes the raw RGB image in parallel with other handcrafted forensic signals. Unlike previous methods that either use only the RGB images or stack several signals in a channel-wise manner, we propose an encoder-decoder architecture that consists of multiple encoder streams. Each stream is fed with either the tampered image or handcrafted signals and processes them separately to capture relevant information from each one independently. Finally, the extracted features from the multiple streams are fused in the bottleneck of the architecture and propagated to the decoder network that generates the output localization map. We experiment with two handcrafted algorithms, i.e., DCT and Splicebuster. Our proposed approach is benchmarked on three public forensics datasets, demonstrating competitive performance against several competing methods and achieving state-of-the-art results, e.g., 0.898 AUC on CASIA.
翻译:在本文中,我们用一个多流网络结构来解决图像拼接本地化问题,该结构将原始 RGB 图像与其他手工制作的法医信号平行处理。与以往的方法不同,我们建议使用一种只使用 RGB 图像或以频道方式堆放若干信号的编码器解码器结构,由多个编码器流组成。每条流都以篡改的图像或手工制作的信号为原料,并单独处理它们,以便从每个流中获取相关信息。最后,从多流中提取的特征被结合到结构的瓶颈中,并传播到生成输出本地化图的解码网络。我们用两种手工制作的算法,即DCT 和 Splicebuster 进行实验。我们提议的方法以三种公共法解码数据集为基准,展示与几种竞合方法的竞争性性表现,并在CSIA上取得最新的结果,例如,0.898 AUC。