The Quantitative Group Testing (QGT) is about learning a (hidden) subset K of some large domain N using the shortest-possible sequence of queries, where a result of a query provides some information about the size of the intersection of the query with the unknown subset K. Due to very large size of N the length of the sequence of queries proportional to |N| is not acceptable. Thus, the goal of QGT is to extract the knowledge about K using the number of queries that only logarithmically depends on |N| and polynomially on |K|. The QGT framework has already proven its applicability to extracting knowledge from large streams of data, such as finding hot elements or histograms, and to identifying infected records using a small number of tests. We also show other applications to extracting elements from long data streams, maintaining dynamically changing graphs and hypergraphs and private parallel Information Retrieval. Almost all previous work focused on randomized algorithms, which in case of large datasets or streams may have a significant deviation from the expected precision. To mitigate this effect and, instead, to seek a solution for any dataset/stream, in this work we propose an efficient non-adaptive deterministic QGT algorithms for constructing queries and deconstructing hidden set K from the results of the queries. The efficiency is three-fold. First, in terms of almost-optimal number of queries and memory to produce them. Second, all algorithms work in polynomial time. Third, they work for any hidden set K, as well as multi-sets, and even if the results of the queries are capped at $\sqrt{|K|}$.
翻译:定量组测试( QGT) 是要使用最短的查询序列来学习某个大域 N 的( 隐藏) 子 K 。 查询的结果为您提供了一些关于查询与未知子子 K 交叉点的大小的信息。 由于查询与 {\\\\\\\\\\\\\\\\\\\\\\\\QQQ, 无法被接受。 因此, QGT 的目标是利用查询数量来获取 K 的知识, 询问数量只能对数取决于 {N ⁇ 和 {K} 。 QGT 框架已经证明了它对于从大量数据流中提取知识( 如查找热元素或直方图), 以及用少量的测试来识别被感染的记录。 我们还展示了从长数据流中提取元素的其他应用程序, 保持动态变化的图表和超直线图以及私人平行的信息Retrieval。 几乎所有先前的工作都集中在随机算法, 以大数据集或流为例, 可能大大偏离预期的精确值 。 。 QGT 框架框架框架框架框架已经证明了从大量 数据流中获取了从大量 。 。 QQQQQQQL 的解算法, 的解算 。