非对称数字系统的最佳压缩 (Compression Optimality of Asymmetric Numeral Systems)

Compression also known as entropy coding has a rich and long history. However, a recent explosion of multimedia Internet applications (such as teleconferencing and video streaming for instance) renews an interest in fast compression that also squeezes out as much redundancy as possible. In 2009 Jarek Duda invented his asymmetric numeral system (ANS). Apart from a beautiful mathematical structure, it is very efficient and offers compression with a very low residual redundancy. ANS works well for any symbol source statistics. Besides, ANS has become a preferred compression algorithm in the IT industry. However, designing ANS instance requires a random selection of its symbol spread function. Consequently, each ANS instance offers compression with a slightly different compression rate. The paper investigates compression optimality of ANS. It shows that ANS is optimal (i.e. the entropies of encoding and source are equal) for any symbol sources whose probability distribution is described by natural powers of 1/2. We use Markov chains to calculate ANS state probabilities. This allows us to determine ANS compression rate precisely. We present two algorithms for finding ANS instances with high compression rates. The first explores state probability approximations in order to choose ANS instances with better compression rates. The second algorithm is a probabilistic one. It finds ANS instances, whose compression rate can be made as close to the best rate as required. This is done at the expense of the number $\theta$ of internal random ``coin'' tosses. The algorithm complexity is ${\cal O}(\theta L^3)$, where $L$ is the number of ANS states. The complexity can be reduced to ${\cal O}(\theta L\log{L})$ if we use a fast matrix inversion. If the algorithm is implemented on quantum computer, its complexity becomes ${\cal O}(\theta (\log{L})^3)$.

翻译：compression 也被称为 entropy coding 。但是, 最近多媒体互联网应用程序( 如远程会议和视频流流等) 的爆炸使得人们重新对快速压缩感兴趣, 并尽可能压缩冗余。 2009 年, Jarek Duda 发明了他的不对称数字系统( ANS ) 。除了一个美丽的数学结构外, 它非常高效, 并且提供了非常低的剩余冗余。 ANS 对任何符号源统计数据来说, 都非常有效。此外, ANS 已经成了信息技术行业中最喜欢的压缩算法。然而, 设计 ANS 时需要随机选择其符号扩散功能。因此, 每个 ANS 时会提供略微不同的压缩速度的压缩。文件调查 ANS 压缩优化优化优化性。它的缩略图( 即编码和来源的精度是等值) 任何符号源, 其概率分布由自然能力描述为 1/2 。我们使用 Markov 链来计算 ANS 状态的概率。这可以让我们精确地确定 ANS 压缩率。我们用两个 NS 。我们用 at rial asal at remaild 来查找。