With the ongoing popularization of online services, the digital document images have been used in various applications. Meanwhile, there have emerged some deep learning-based text editing algorithms which alter the textual information of an image . In this work, we present a document forgery algorithm to edit practical document images. To achieve this goal, the limitations of existing text editing algorithms towards complicated characters and complex background are addressed by a set of network design strategies. First, the unnecessary confusion in the supervision data is avoided by disentangling the textual and background information in the source images. Second, to capture the structure of some complicated components, the text skeleton is provided as auxiliary information and the continuity in texture is considered explicitly in the loss function. Third, the forgery traces induced by the text editing operation are mitigated by some post-processing operations which consider the distortions from the print-and-scan channel. Quantitative comparisons of the proposed method and the exiting approach have shown the advantages of our design by reducing the about 2/3 reconstruction error measured in MSE, improving reconstruction quality measured in PSNR and in SSIM by 4 dB and 0.21, respectively. Qualitative experiments have confirmed that the reconstruction results of the proposed method are visually better than the existing approach. More importantly, we have demonstrated the performance of the proposed document forgery algorithm under a practical scenario where an attacker is able to alter the textual information in an identity document using only one sample in the target domain. The forged-and-recaptured samples created by the proposed text editing attack and recapturing operation have successfully fooled some existing document authentication systems.
翻译:随着在线服务的不断普及,数字文件图像被应用于各种应用中。与此同时,出现了一些深层次的基于学习的文本编辑算法,改变了图像的文字信息。在这项工作中,我们提出了一个用于编辑实用文件图像的文件伪造算法。为实现这一目标,一套网络设计战略解决了现有文本编辑算法对复杂字符和复杂背景的局限性。首先,通过在源图像中分离文本和背景信息,避免了监督数据中的不必要混乱。第二,为了捕捉一些复杂的组成部分的结构,提供了文本骨架,作为辅助信息,在丢失功能中明确考虑纹理的连续性。第三,文本编辑操作引发的伪造痕迹通过一些后处理操作得到减轻,这些操作考虑到印刷和扫描渠道的扭曲。对拟议方法和退出方法的定量比较表明我们的设计的优点,通过移动MSE中测得的大约2/3重建错误,提高PSNR和SSIM中的重建质量,4 dB 和0.21 文本中的重建质量。在损失功能功能函数分析中,一个比较性化文件的改进是现有文件的改进后期。在目前测试中,一个比较性文件的改进了现有文件的改进后期。