The Longest Common Subsequence (LCS) of two strings is a fundamental string similarity measure with a classical dynamic programming solution taking quadratic time. Despite significant efforts, little progress was made in improving the runtime. Even in the realm of approximation, not much was known for linear time algorithms beyond the trivial $\sqrt{n}$-approximation. Recent breakthrough result provided a $n^{0.497}$-factor approximation algorithm [HSSS19], which was more recently improved to a $n^{0.4}$-factor one [BCD21]. The latter paper also showed a $n^{2-2.5\alpha}$ time algorithm which outputs a $n^{\alpha}$ approximation to the LCS, but so far no sub-polynomial approximation is known in truly subquadratic time. In this work, we show an algorithm which runs in $O(n)$ time, and outputs a $n^{o(1)}$-factor approximation to LCS$(x,y)$, with high probability, for any pair of length $n$ input strings. Our entire algorithm is merely an efficient black-box reduction to the Block-LIS problem, introduced very recently in [ANSS21], and solving the Block-LIS problem directly.
翻译:两个字符串中最长期常见的子序列(LCS)是一条基本链条的相似度,一个典型的动态编程解决方案正在四进制时间。尽管做出了重大努力,但在改进运行时间方面进展甚微。即使在近似领域,除了小的 $@sqrt{n}$-occol admination(LCS19) 外,线性时间算法也鲜为人知。最近的突破结果提供了一种以美元(n)美元计价的算法[HSS19],该算法最近改进为1美元(BCD21)至1美元(BCD21)。后一份文件还显示了一个美元($2-2.5\alpha})的时间算法,该算法产生美元接近LCS的近似值,但迄今为止,在真正次偏差的时段里,还不知道什么亚极近的近的线性算法。我们展示了一种以美元(n)时间运行的算法,而产出以美元(x)$(y)至21美元(x)美元(x)美)的近似的近似值美元。我们的算法在任何一对美元输入链中,只是将一个有效的降低问题压解到直接解决整个。