从子字符串构成进行字符串重建的新代数法 (A New Algebraic Approach for String Reconstruction from Substring Compositions)

We consider the problem of binary string reconstruction from the multiset of its substring compositions, i.e., referred to as the substring composition multiset, first introduced and studied by Acharya et al. We introduce a new algorithm for the problem of string reconstruction from its substring composition multiset which relies on the algebraic properties of the equivalent bivariate polynomial formulation of the problem. We then characterize specific algebraic conditions for the binary string to be reconstructed that guarantee the algorithm does not require any backtracking through the reconstruction, and, consequently, the time complexity is bounded polynomially. More specifically, in the case of no backtracking, our algorithm has a time complexity of $O(n^2)$ compared to the algorithm by Acharya et al., which has a time complexity of $O(n^2\log(n))$, where $n$ is the length of the binary string. Furthermore, it is shown that larger sets of binary strings are uniquely reconstructable by the new algorithm and without the need for backtracking leading to codebooks of reconstruction codes that are larger, by a linear factor in size, compared to the previously known construction by Pattabiraman et al., while having $O(n^2)$ reconstruction complexity.

翻译：我们从子字符串的多层结构中考虑二进制重整的问题,即所谓的子字符串的多层结构,首先由Acharya等人介绍和研究。我们从子字符串的多层结构中引入了一种新的算法,从它的子字符串的多层结构中引入了一个新的算法。我们引入了一种新的算法,解决弦线的重建问题,它依赖于对等的双进多义多语系配方的代数特性。然后我们给要重建的二进制字符串确定具体的代数条件,保证算法不需要在重建过程中进行任何回溯跟踪,因此,时间的复杂性是多式的。更具体地说,在没有回溯跟踪的情况下,我们的算法与Acharya等人的算法相比,其时间复杂性为$O(n)2\log(n),其中美元是二进制字符串的长度。此外,还表明,更大的二进制字符串的比对后行法是独特的,不需要导致后行的美元,而前一进法则以已知的里程的里程来比较。