In this paper we study the number $r_{bwt}$ of equal-letter runs produced by the Burrows-Wheeler transform ($BWT$) when it is applied to purely morphic finite words, which are words generated by iterating prolongable morphisms. Such a parameter $r_{bwt}$ is very significant since it provides a measure of the performances of the $BWT$, in terms of both compressibility and indexing. In particular, we prove that, when $BWT$ is applied to any purely morphic finite word on a binary alphabet, $r_{bwt}$ is $\mathcal{O}(\log n)$, where $n$ is the length of the word. Moreover, we prove that $r_{bwt}$ is $\Theta(\log n)$ for the binary words generated by a large class of prolongable binary morphisms. These bounds are proved by providing some new structural properties of the \emph{bispecial circular factors} of such words.
翻译:在本文中,我们研究了由Burrows-Wheeler变换(BWT$)产生的等量字母的美元数,当它被应用到纯定式的限定词时,这些词是可变长的变换形态产生的单词。这样的参数$r ⁇ bwt$是十分重要的,因为它提供了对美元(BWT)的可压缩性和索引性两方面表现的衡量。特别是,我们证明,当美元(BWT)用于二元字母上任何纯定式的单词时,${bwt}$是 $\ mathcal{O}(\log n)$, 美元是该单词的长度。此外,我们证明,对于一大批可变长的双曲性词产生的双元(n),$($)是$\theta(log n) 。通过提供这类单词的\emph{crecial因的一些新的结构属性来证明这些界限。