In group testing, the goal is to identify a subset of defective items within a larger set of items based on tests whose outcomes indicate whether at least one defective item is present. This problem is relevant in areas such as medical testing, DNA sequencing, communication protocols, and many more. In this paper, we study (i) a sparsity-constrained version of the problem, in which the testing procedure is subjected to one of the following two constraints: items are finitely divisible and thus may participate in at most $\gamma$ tests; or tests are size-constrained to pool no more than $\rho$ items per test; and (ii) a noisy version of the problem, where each test outcome is independently flipped with some constant probability. Under each of these settings, considering the for-each recovery guarantee with asymptotically vanishing error probability, we introduce a fast splitting algorithm and establish its near-optimality not only in terms of the number of tests, but also in terms of the decoding time. While the most basic formulations of our algorithms require $\Omega(n)$ storage for each algorithm, we also provide low-storage variants based on hashing, with similar recovery guarantees.
翻译:在群件测试中,目标是在测试结果显示至少有一个有缺陷的项目存在的情况下,根据测试结果显示至少存在一个有缺陷的项目,在更大系列的物品中确定一组有缺陷的物品。这个问题在医学测试、DNA测序、通信协议和许多其他领域都有关联性。在本文中,我们研究(一) 问题的宽度限制版本,测试程序受到以下两个限制之一的制约:项目有一定的可分解性,因此可以最多参加$\gamma$的测试;或测试因体积限制而不能将每个测试的物品集中起来不超过$/rho$;以及(二) 问题的噪音版本,其中每个测试结果都以某种恒定概率独立翻转。在每种情况下,我们考虑到以尽可能消逝的误差概率来保证每个恢复,我们采用快速分解算法,并不仅在测试数量上,而且在解码时间上确定其近乎最佳性。虽然我们算法的最基本配方需要$\美元(n)的存储方式,但每个变式都以类似的方式提供。