LZ-End is a variant of the well-known Lempel-Ziv parsing family such that each phrase of the parsing has a previous occurrence, with the additional constraint that the previous occurrence must end at the end of a previous phrase. LZ-End was initially proposed as a greedy parsing, where each phrase is determined greedily from left to right, as the longest factor that satisfies the above constraint~[Kreft & Navarro, 2010]. In this work, we consider an optimal LZ-End parsing that has the minimum number of phrases in such parsings. We show that a decision version of computing the optimal LZ-End parsing is NP-complete by showing a reduction from the vertex cover problem. Moreover, we give a MAX-SAT formulation for the optimal LZ-End parsing adapting an approach for computing various NP-hard repetitiveness measures recently presented by [Bannai et al., 2022]. We also consider the approximation ratio of the size of greedy LZ-End parsing to the size of the optimal LZ-End parsing, and give a lower bound of the ratio which asymptotically approaches $2$.
翻译:LZ-End 是众所周知的 Lempel-Ziv 剖析家族的变体, 使每个评析的词句都以前发生过, 额外的制约是, 之前的评析必须结束。 LZ- End 最初被提议为贪婪的分解, 每个词句都是从左到右被贪婪地确定为满足上述制约[ Kreft & Navarro, 2010] 的最长因素。 在这项工作中, 我们考虑的是最佳的LZ- End 评析, 其数量在此类评析中最小。 我们表明, 计算最佳的 LZ- End 评析的决定版本通过显示对顶端覆盖问题的减少来完成。 此外, 我们给出了最优的 LZ- End 公式, 以调整计算最近[ Bannai 等人, 提出的各种NP- 硬重复性措施的方法。 我们还考虑了贪婪 LZ- End 的近似比率, 将最佳的LZ- End 表示为最低比例的LZ- End 。