We revisit the well-studied problem of estimating the Shannon entropy of a probability distribution, now given access to a probability-revealing conditional sampling oracle. In this model, the oracle takes as input the representation of a set $S$ and returns a sample from the distribution obtained by conditioning on $S$, together with the probability of that sample in the distribution. Our work is motivated by applications of such algorithms in Quantitative Information Flow analysis (QIF) in programming-language-based security. Here, information-theoretic quantities capture the effort required on the part of an adversary to obtain access to confidential information. These applications demand accurate measurements when the entropy is small. Existing algorithms that do not use conditional samples require a number of queries that scale inversely with the entropy, which is unacceptable in this regime, and indeed, a lower bound by Batu et al.(STOC 2002) established that no algorithm using only sampling and evaluation oracles can obtain acceptable performance. On the other hand, prior work in the conditional sampling model by Chakraborty et al.(SICOMP 2016) only obtained a high-order polynomial query complexity, $\mathcal{O}(\frac{m^7}{\epsilon^8}\log\frac{1}{\delta})$ queries, to obtain additive $\epsilon$-approximations on a domain of size $\mathcal{O}(2^m)$. We obtain multiplicative $(1+\epsilon)$-approximations using only $\mathcal{O}(\frac{m}{\epsilon^2}\log\frac{1}{\delta})$ queries to the probability-revealing conditional sampling oracle. Indeed, moreover, we obtain small, explicit constants, and demonstrate that our algorithm obtains a substantial improvement in practice over the previous state-of-the-art methods used for entropy estimation in QIF.
 翻译:我们重新审视了估算概率分布的香农( QIF) 问题, 现在给予访问概率翻现的有条件取样或触摸的机会 。 在这个模型中, 甲骨文将一组S$的表示作为输入, 并返回通过调制美元获得的分配样本的样本, 以及分布中该样本的概率。 我们的工作受定量信息流程分析( QIF) 中应用这种算法的驱动。 这里, 信息- 理论数量可以捕捉对手获取机密信息所需的努力。 这些应用程序需要精确测量 。 不使用有条件样本的现有算法需要一定数量的查询, 而此系统无法接受, 事实上, Batu etal 和 al. (STOC 2002) 确定, 在基于编程的基于语言的安全中, 任何仅使用取样和评估的算法都无法取得可接受的性能。 在另一边, 查克拉博特( COMP 2016) 的有条件采样模型中, 仅获取了使用高频程程程的解算法 。