With the development of computer graphics technology, the images synthesized by computer software become more and more closer to the photographs. While computer graphics technology brings us a grand visual feast in the field of games and movies, it may also be utilized by someone with bad intentions to guide public opinions and cause political crisis or social unrest. Therefore, how to distinguish the computer-generated graphics (CG) from the photographs (PG) has become an important topic in the field of digital image forensics. This paper proposes a dual stream convolutional neural network based on channel joint and softpool. The proposed network architecture includes a residual module for extracting image noise information and a joint channel information extraction module for capturing the shallow semantic information of image. In addition, we also design a residual structure to enhance feature extraction and reduce the loss of information in residual flow. The joint channel information extraction module can obtain the shallow semantic information of the input image which can be used as the information supplement block of the residual module. The whole network uses SoftPool to reduce the information loss of down-sampling for image. Finally, we fuse the two flows to get the classification results. Experiments on SPL2018 and DsTok show that the proposed method outperforms existing methods, especially on the DsTok dataset. For example, the performance of our model surpasses the state-of-the-art by a large margin of 3%.
翻译:随着计算机图形技术的发展,计算机软件合成的图像越来越接近照片。虽然计算机图形技术给我们带来了游戏和电影领域的大型视觉盛宴,但也可能被那些意图不良的人用来引导公众意见并造成政治危机或社会动乱。因此,如何将计算机生成的图形(CG)与照片(PG)区别开来已成为数字图像法证领域的一个重要主题。本文件提议在频道联合和软资源库的基础上建立一个双流神经神经网络。拟议的网络结构包括一个提取图像噪音信息的剩余模块和一个收集浅浅图像语义信息的联合频道信息提取模块。此外,我们还设计了一个剩余结构,以加强特征提取并减少信息在剩余流中丢失的信息。联合频道信息提取模块可以获取作为剩余模块信息补充块的输入图像(PG)的浅层语义信息。整个网络使用SoftPool来减少图像下标的信息损失。最后,我们将两种流动整合为获取图像的浅色语义信息提取模块,特别是Dexiorates的运行方式。在现有的 SPL18 和 Dexivers 上展示了现有数据上的大型模型。