With recent advances in machine learning, researchers are now able to solve traditional problems with new solutions. In the area of digital watermarking, deep-learning-based watermarking technique is being extensively studied. Most existing approaches adopt a similar encoder-driven scheme which we name END (Encoder-NoiseLayer-Decoder) architecture. In this paper, we revamp the architecture and creatively design a decoder-driven watermarking network dubbed De-END which greatly outperforms the existing END-based methods. The motivation for designing De-END originated from the potential drawback we discovered in END architecture: The encoder may embed redundant features that are not necessary for decoding, limiting the performance of the whole network. We conducted a detailed analysis and found that such limitations are caused by unsatisfactory coupling between the encoder and decoder in END. De-END addresses such drawbacks by adopting a Decoder-Encoder-Noiselayer-Decoder architecture. In De-END, the host image is firstly processed by the decoder to generate a latent feature map instead of being directly fed into the encoder. This latent feature map is concatenated to the original watermark message and then processed by the encoder. This change in design is crucial as it makes the feature of encoder and decoder directly shared thus the encoder and decoder are better coupled. We conducted extensive experiments and the results show that this framework outperforms the existing state-of-the-art (SOTA) END-based deep learning watermarking both in visual quality and robustness. On the premise of the same decoder structure, the visual quality (measured by PSNR) of De-END improves by 1.6dB (45.16dB to 46.84dB), and extraction accuracy after JPEG compression (QF=50) distortion outperforms more than 4% (94.9% to 99.1%).
翻译:随着机器学习的最新进展,研究人员现在能够用新的解决方案解决传统问题。在数字水印领域,正在广泛研究基于深学习的水印技术。大多数现有方法都采用了类似的编码器驱动方案,我们命名为 END( Encoder-Noise Layer-Decoder) 架构。在本文中,我们改造了结构,创造性地设计了一个代号为Decoder驱动的水印网络,大大超越了现有的基于 END 的变异方法。设计De-end的动机源于我们在 END 架构中发现的潜在缺陷:编码器可能嵌入一个多余的功能,而这种功能对于解码来说并不必要,限制了整个网络的性能。我们进行了详细分析,发现这种局限性是由于在END(E)和解码驱动器的解码驱动器之间发生不令人满意的合并。De-ender-ender-Nationer-Decoder 架构大大超越了已有的变异性。在Deender-Deender 结构中,主机图像首先由Deco 解算器进行处理,然后由这个直译器将这个直态的直译结果转换成,然后通过直置的解到直置的 Rde-dededededededededededeal 和直译的 Rdemodeal 将这个原始的图制成成为该元的图,而成为了这个直成。