衡量重复性L系统* (L-systems for Measuring Repetitiveness*)

An L-system (for lossless compression) is a CPD0L-system extended with two parameters $d$ and $n$, which determines unambiguously a string $w = \tau(\varphi^d(s))[1:n]$, where $\varphi$ is the morphism of the system, $s$ is its axiom, and $\tau$ is its coding. The length of the shortest description of an L-system generating $w$ is known as $\ell$, and is arguably a relevant measure of repetitiveness that builds on the self-similarities that arise in the sequence. In this paper we deepen the study of the measure $\ell$ and its relation with $\delta$, a better established lower bound that builds on substring complexity. Our results show that $\ell$ and $\delta$ are largely orthogonal, in the sense that one can be much larger than the other depending on the case. This suggests that both sources of repetitiveness are mostly unrelated. We also show that the recently introduced NU-systems, which combine the capabilities of L-systems with bidirectional macro-schemes, can be asymptotically strictly smaller than both mechanisms, which makes the size $\nu$ of the smallest NU-system the unique smallest reachable repetitiveness measure to date.

翻译：L系统(用于无损压缩)是一个扩展的CPD0L系统(用于无损压缩),有2个参数(美元)和1美元(美元),它明确确定一个字符串 $ =\ tau( varphip ⁇ d) [1:n]$, 美元是该系统的形态, 美元是其正态, 美元是其正态, 美元是其编码是美元。生成美元( w$)的L系统最短描述的长度被称为 $ / ell, 可以说是一种相关的重复性衡量标准, 以序列中出现的自我差异为基础。在本文中, 我们深化了对 $( varphipi) 及其与 $( delta$) 的关系的研究, 美元是建立在亚化复杂性基础上的更固定的更低约束。我们的结果显示, $( ell$) 和 $( dedelta) $( $) 是一个大为正态的,, 意即一个系统可以比另一个大得多, 视具体情况而定, 重复性来源为不相近似。我们还显示, 最近引入的NU系统系统系统可以将最小的周期与最小的周期与最小的货币机制结合起来合并起来。