We consider the communication complexity of the Hamming distance of two strings. Bille et al. [SPIRE 2018] considered the communication complexity of the longest common prefix (LCP) problem in the setting where the two parties have their strings in a compressed form, i.e., represented by the Lempel-Ziv 77 factorization (LZ77) with/without self-references. We present a randomized public-coin protocol for a joint computation of the Hamming distance of two strings represented by LZ77 without self-references. While our scheme is heavily based on Bille et al.'s LCP protocol, our complexity analysis is original which uses Crochemore's C-factorization and Rytter's AVL-grammar. As a byproduct, we also show that LZ77 with/without self-references are not monotonic in the sense that their sizes can increase by a factor of 4/3 when a prefix of the string is removed.
翻译:我们考虑了两个字符串的Hamming距离的通信复杂性。 Bile 等人[SPIRE 2018] 考虑了双方以压缩形式,即以Lempel-Ziv 77因子化(LZ77)为代表,且/无自我参照,在两个字符串的Hamming距离(LCP)的环境下,最长的共同前缀(LCP)问题的通信复杂性。我们提出了一个随机化的公币协议,用于联合计算LZ77所代表两个字符串的Hamming距离,而没有自我参照。虽然我们的计划在很大程度上基于Bile et al的LCP协议,但我们的复杂度分析是原始的,使用Crochemore的C-因子化和Rytter的AVL-grammar。作为一个副产品,我们还表明,在删除字符串的前缀时,其大小可能增加4/3的系数。