The Invertible Bloom Lookup Table (IBLT) is a probabilistic concise data structure for set representation that supports a listing operation as the recovery of the elements in the represented set. Its applications can be found in network synchronization and traffic monitoring as well as in error-correction codes. IBLT can list its elements with probability affected by the size of the allocated memory and the size of the represented set, such that it can fail with small probability even for relatively small sets. While previous works only studied the failure probability of IBLT, this work initiates the worst case analysis of IBLT that guarantees successful listing for all sets of a certain size. The worst case study is important since the failure of IBLT imposes high overhead. We describe a novel approach that guarantees successful listing when the set satisfies a tunable upper bound on its size. To allow that, we develop multiple constructions that are based on various coding techniques such as stopping sets and the stopping redundancy of error-correcting codes, Steiner systems, and covering arrays as well as new methodologies we develop. We analyze the sizes of IBLTs with listing guarantees obtained by the various methods as well as their mapping memory consumption. Lastly, we study lower bounds on the achievable sizes of IBLT with listing guarantees and verify the results in the paper by simulations.
翻译:不可逆的 Bloom Lookup Table (IBLT) 是一个用于集成代表的概率简洁的数据结构, 用于支持列表操作, 作为代表集中元素的恢复。 其应用程序可以在网络同步、 交通监测以及错误校正代码中找到。 IBLT 可以列出其元素, 其概率受分配内存大小和代表集大小的影响, 这样,即使对于相对小的数据集, 它也有可能很小地失败。 以前的工作只研究了 IBLT 的失灵概率, 这项工作启动了IBLT 最差的个案分析, 保证成功列出所有各套特定尺寸的数据集。 最差的案例研究很重要, 因为 IBLT 的失败导致高管理管理率。 我们描述了一种新办法, 保证在集集符合所分配内存内存内存内存大小上限时能够成功列出。 为了允许这一点, 我们开发多种构造, 以各种编码技术为基础, 如停止设置和停止错误校正代码的冗余, 施泰纳系统, 覆盖阵列以及我们开发的新方法。 我们分析了IBLT的大小,, 列出各种方法获得的保证, 的大小, 以及以可实现ILTLT 模 格式的校验的校验结果。