Audio watermarking is widely used for leaking source tracing. The robustness of the watermark determines the traceability of the algorithm. With the development of digital technology, audio re-recording (AR) has become an efficient and covert means to steal secrets. AR process could drastically destroy the watermark signal while preserving the original information. This puts forward a new requirement for audio watermarking at this stage, that is, to be robust to AR distortions. Unfortunately, none of the existing algorithms can effectively resist AR attacks due to the complexity of the AR process. To address this limitation, this paper proposes DeAR, a deep-learning-based audio re-recording resistant watermarking. Inspired by DNN-based image watermarking, we pioneer a deep learning framework for audio carriers, based on which the watermark signal can be effectively embedded and extracted. Meanwhile, in order to resist the AR attack, we delicately analyze the distortions that occurred in the AR process and design the corresponding distortion layer to cooperate with the proposed watermarking framework. Extensive experiments show that the proposed algorithm can resist not only common electronic channel distortions but also AR distortions. Under the premise of high-quality embedding (SNR=25.86dB), in the case of a common re-recording distance (20cm), the algorithm can effectively achieve an average bit recovery accuracy of 98.55%.
翻译:音频水印技术广泛用于泄露源跟踪。水印的鲁棒性决定了算法的可追溯性。随着数字技术的发展,音频重录(AR)已成为窃取机密信息的一种高效和隐蔽的手段。AR过程可能会彻底破坏水印信号,同时还保留原始信息。这为当前的音频水印技术提出了一项新的要求,即对AR失真具有鲁棒性。不幸的是,由于AR过程的复杂性,现有算法均无法有效抵抗AR攻击。为解决这一限制,本文提出了DeAR,一种基于深度学习的音频重录抗干扰水印技术。受基于DNN的图像水印技术启发,我们开创了一种音频载体的深度学习框架,基于此框架,可以有效地嵌入和提取水印信号。同时,为了抵御AR攻击,我们精细分析了AR过程中发生的失真,并设计了相应的失真层,以配合所提出的水印技术框架。大量实验表明,所提出的算法不仅可以抵抗常见的电子信道失真,还可以抵御AR失真。在高质量嵌入(SNR = 25.86dB)的前提下,在常见的重录距离(20cm)的情况下,该算法可以有效地实现平均比特恢复准确率为98.55%。