重建噪音集合数据 (Distributed Reconstruction of Noisy Pooled Data)

In the pooled data problem we are given a set of $n$ agents, each of which holds a hidden state bit, either $0$ or $1$. A querying procedure returns for a query set the sum of the states of the queried agents. The goal is to reconstruct the states using as few queries as possible. In this paper we consider two noise models for the pooled data problem. In the noisy channel model, the result for each agent flips with a certain probability. In the noisy query model, each query result is subject to random Gaussian noise. Our results are twofold. First, we present and analyze for both error models a simple and efficient distributed algorithm that reconstructs the initial states in a greedy fashion. Our novel analysis pins down the range of error probabilities and distributions for which our algorithm reconstructs the exact initial states with high probability. Secondly, we present simulation results of our algorithm and compare its performance with approximate message passing (AMP) algorithms that are conjectured to be optimal in a number of related problems.

翻译：在集合数据问题中,我们得到了一组美元代理,每个代理都持有隐藏状态位数,或0美元或1美元。查询程序返回一个查询程序,以设置被查询代理数的总和。目标是尽可能少地使用查询来重建各州。在本文中,我们考虑对集合数据问题使用两种噪音模型。在吵闹的频道模型中,每个代理数的结果有一定的概率翻转。在吵闹的查询模型中,每个查询结果都受随机高斯语噪音的影响。我们的结果是双重的。首先,我们为两个错误模型提出并分析一个简单而高效的分布算法,以贪婪的方式重建最初的州。我们的新分析将错误概率和分布的范围缩小下来,而我们的算法则极有可能重建准确的初始状态。第二,我们提出我们的算法的模拟结果,并将其性能与近似信息传递的算法进行比较,这些算出在一些相关问题上是最佳的。