Audio steganography aims at concealing secret information in carrier audio with imperceptible modification on the carrier. Although previous works addressed the robustness of concealed message recovery against distortions introduced during transmission, they do not address the robustness against aggressive editing such as mixing of other audio sources and source separation. In this work, we propose for the first time a steganography method that can embed information into individual sound sources in a mixture such as instrumental tracks in music. To this end, we propose a time-domain model and curriculum learning essential to learn to decode the concealed message from the separated sources. Experimental results show that the proposed method successfully conceals the information in an imperceptible perturbation and that the information can be correctly recovered even after mixing of other sources and separation by a source separation algorithm. Furthermore, we show that the proposed method can be applied to multiple sources simultaneously without interfering with the decoder for other sources even after the sources are mixed and separated.
翻译:音频剖析的目的是在承运人的音频中隐藏秘密信息,对承运人进行无法察觉的修改。虽然以前的工作解决了在传输过程中针对扭曲现象进行隐蔽信息回收的稳健性,但并没有解决对积极编辑的稳健性,例如其他音频来源的混合和源的分离。在这项工作中,我们首次提议了一种可将信息嵌入单个音源的静默性方法,例如音乐中的工具轨迹。为此,我们提议了一个时间域模型和课程学习学习,以学习解码从分离源解码隐藏的信息。实验结果表明,拟议的方法成功地将信息隐藏在不可察觉的扰动中,而且即使在将其他来源混合并用源分离算法分离之后,信息也可以正确收回。此外,我们表明,拟议的方法可以同时适用于多个源,而不会干扰其他来源的解码器。