Copy-move forgery is a manipulation of copying and pasting specific patches from and to an image, with potentially illegal or unethical uses. Recent advances in the forensic methods for copy-move forgery have shown increasing success in detection accuracy and robustness. However, for images with high self-similarity or strong signal corruption, the existing algorithms often exhibit inefficient processes and unreliable results. This is mainly due to the inherent semantic gap between low-level visual representation and high-level semantic concept. In this paper, we present a very first study of trying to mitigate the semantic gap problem in copy-move forgery detection, with spatial pooling of local moment invariants for midlevel image representation. Our detection method expands the traditional works on two aspects: 1) we introduce the bag-of-visual-words model into this field for the first time, may meaning a new perspective of forensic study; 2) we propose a word-to-phrase feature description and matching pipeline, covering the spatial structure and visual saliency information of digital images. Extensive experimental results show the superior performance of our framework over state-of-the-art algorithms in overcoming the related problems caused by the semantic gap.
翻译:复制式伪造是复制和粘贴特定图象的操纵,可能是非法的或不道德的用途。最近复制式伪造法法学方法的进步显示,在探测准确性和稳健性方面越来越成功。然而,对于自我高度相似或信号严重腐败的图像,现有的算法往往显示效率低下的过程和不可靠的结果。这主要是由于低水平视觉表现和高层次语义概念之间固有的语义差异。在本文中,我们提出了试图减少复制式移动伪造探测中的语义差距问题的首项研究,同时将局部变异性时段空间集中用于中层图像代表。我们的检测方法扩大了传统工作在两个方面:(1) 我们首次将视觉词包模型引入这一领域,可能意味着对法医研究的新观点;(2) 我们提出了一个字对字的特征描述和匹配管道,涵盖数字图像的空间结构和视觉特征信息。广泛的实验结果显示我们框架在克服由磁性差距造成的相关问题方面优于状态艺术算法。