Steganography is the practice of encoding secret information into innocuous content in such a manner that an adversarial third party would not realize that there is hidden meaning. While this problem has classically been studied in security literature, recent advances in generative models have led to a shared interest among security and machine learning researchers in developing scalable steganography techniques. In this work, we show that a steganography procedure is perfectly secure under Cachin (1998)'s information theoretic-model of steganography if and only if it is induced by a coupling. Furthermore, we show that, among perfectly secure procedures, a procedure is maximally efficient if and only if it is induced by a minimum entropy coupling. These insights yield what are, to the best of our knowledge, the first steganography algorithms to achieve perfect security guarantees with non-trivial efficiency; additionally, these algorithms are highly scalable. To provide empirical validation, we compare a minimum entropy coupling-based approach to three modern baselines -- arithmetic coding, Meteor, and adaptive dynamic grouping -- using GPT-2, WaveRNN, and Image Transformer as communication channels. We find that the minimum entropy coupling-based approach achieves superior encoding efficiency, despite its stronger security constraints. In aggregate, these results suggest that it may be natural to view information-theoretic steganography through the lens of minimum entropy coupling.
翻译:隐写术是将机密信息编码到无害内容中的做法,以至于第三方对手无法察觉到存在隐藏的意义。虽然这个问题经典地被研究在安全论文中,但是生成模型的最新进展使得安全和机器学习研究者之间共同关注开发可扩展的隐写术技术。在这项工作中,我们证明了如果隐写程式是通过耦合所引导产生的,那么根据 Cachin(1998)的信息熵理论模型,隐写程式是绝对安全的。此外,我们还证明,除去绝对安全的程序之外,在绝对安全的程序中,如果是通过最小熵耦合所引导产生,那么它最大效率。这些见解得出了令人称道的成果:据最好的我们所知,这是第一个具有非平凡效率的同时又能够达到绝对安全保证的隐写术算法。此外,这些算法是高度可扩展的。为了提供实证验证,我们将最小熵耦合方法与三种现代基线算法--算术编码、Meteor 和自适应动态分组--在 GPT-2、WaveRNN 和 ImageTransformer 通讯渠道使用下进行了比较。我们发现,尽管有更强的安全限制,最小熵耦合法在编码效率方面得到了优越的结果。总的来说,这些结果表明,可能自然地将信息理论隐写术视作是通过最小熵耦合而实现的。