The key challenge of image manipulation detection is how to learn generalizable features that are sensitive to manipulations in novel data, whilst specific to prevent false alarms on authentic images. Current research emphasizes the sensitivity, with the specificity overlooked. In this paper we address both aspects by multi-view feature learning and multi-scale supervision. By exploiting noise distribution and boundary artifact surrounding tampered regions, the former aims to learn semantic-agnostic and thus more generalizable features. The latter allows us to learn from authentic images which are nontrivial to be taken into account by current semantic segmentation network based methods. Our thoughts are realized by a new network which we term MVSS-Net. Extensive experiments on five benchmark sets justify the viability of MVSS-Net for both pixel-level and image-level manipulation detection.
翻译:图像操纵探测的关键挑战是如何学习对新数据操纵敏感的通用特征,而具体地说则是防止真实图像上的虚假警报。当前研究强调敏感度,忽略了特殊性。在本文件中,我们通过多视图特征学习和多尺度监督来处理这两个方面。前者利用被篡改区域周围的噪音分布和边界工艺品,目的是学习语义学-神学特征,从而更加普遍化。后者使我们能够从真实图像中学习目前基于语义分割网络的方法所考虑的非技术性图像。我们的想法是通过我们称为MVSS-Net的新网络实现的。对五套基准集的广泛实验证明MVSS-Net在像素水平和图像水平操纵检测方面的可行性。