We study the information leakage to a guessing adversary in zero-error source coding. The source coding problem is defined by a confusion graph capturing the distinguishability between source symbols. The information leakage is measured by the ratio of the adversary's successful guessing probability after and before eavesdropping the codeword, maximized over all possible source distributions. Such measurement under the basic adversarial model where the adversary makes a single guess and allows no distortion between its estimator and the true sequence is known as the maximum min-entropy leakage or the maximal leakage in the literature. We develop a single-letter characterization of the optimal normalized leakage under the basic adversarial model, together with an optimum-achieving scalar stochastic mapping scheme. An interesting observation is that the optimal normalized leakage is equal to the optimal compression rate with fixed-length source codes, both of which can be simultaneously achieved by some deterministic coding schemes. We then extend the leakage measurement to generalized adversarial models where the adversary makes multiple guesses and allows certain level of distortion, for which we derive single-letter lower and upper bounds.
翻译:我们研究在零危险源编码中向猜想对手泄漏的信息。 源代码问题的定义是用一个模糊的图解来区分源符号的区别。 信息渗漏的衡量方法是,在窃听代码字之后和之前,对手成功猜测概率的比例; 最大限度地覆盖所有可能的源分布。 基本对称模型下的测量方法, 对手只猜测一次, 不允许其估计值与真实序列之间发生扭曲, 被称为最大最小孔径渗漏或文献中的最大渗漏。 我们对基本对称模型下的最佳常态渗漏进行了单字母定性, 并制定了最佳实现的标度缩放绘图方法。 一个有趣的观察是, 最佳常态渗漏与使用固定长度源代码的最佳压缩率相等, 这两种方法都可以由某些确定性调试方法同时实现。 然后将渗漏测量方法扩大到一般的对称模型, 敌人进行多重猜测, 允许某种程度的扭曲, 我们从中得出单字母的下层和上层。