线性插入删除码在高噪声和高速率情况下的应用 (Linear Insertion Deletion Codes in the High-Noise and High-Rate Regimes)

This work continues the study of linear error correcting codes against adversarial insertion deletion errors (insdel errors). Previously, the work of Cheng, Guruswami, Haeupler, and Li \cite{CGHL21} showed the existence of asymptotically good linear insdel codes that can correct arbitrarily close to $1$ fraction of errors over some constant size alphabet, or achieve rate arbitrarily close to $1/2$ even over the binary alphabet. As shown in \cite{CGHL21}, these bounds are also the best possible. However, known explicit constructions in \cite{CGHL21}, and subsequent improved constructions by Con, Shpilka, and Tamo \cite{9770830} all fall short of meeting these bounds. Over any constant size alphabet, they can only achieve rate $< 1/8$ or correct $< 1/4$ fraction of errors; over the binary alphabet, they can only achieve rate $< 1/1216$ or correct $< 1/54$ fraction of errors. Apparently, previous techniques face inherent barriers to achieve rate better than $1/4$ or correct more than $1/2$ fraction of errors. In this work we give new constructions of such codes that meet these bounds, namely, asymptotically good linear insdel codes that can correct arbitrarily close to $1$ fraction of errors over some constant size alphabet, and binary asymptotically good linear insdel codes that can achieve rate arbitrarily close to $1/2$.\ All our constructions are efficiently encodable and decodable. Our constructions are based on a novel approach of code concatenation, which embeds the index information implicitly into codewords. This significantly differs from previous techniques and may be of independent interest. Finally, we also prove the existence of linear concatenated insdel codes with parameters that match random linear codes, and propose a conjecture about linear insdel codes.

翻译：本文继续研究线性纠错码对抗插入删除错误（insdel错误）的应用。先前的研究表明，在某些恒定大小的字母表上，或即使在二进制字母表上，有渐近好的线性insdel码可以纠正接近于1的错误分数或实现一半速率，这些边界也是最优的。然而，CGHL21中的已知显式构造，以及之后由Con、Shpilka和Tamo在文献编号为9770830中改进的构造，都未能达到此要求。在任何恒定大小的字母表下，他们只能实现速率< 1/8或纠正< 1/4的错误分数；在二进制字母表下，他们只能实现速率 <1/1216或纠正<1/54的错误分数。先前的技术似乎面临着实现速率超过1/4或纠正超过1/2错误分数的固有障碍。在本文中，我们提供了能够达到这些要求的新构造，即能够纠正恒定大小字母表上接近于1的错误分数的渐近好的线性insdel码，以及能够实现速率接近于1/2的二进制渐近好的线性insdel码。所有我们的构造都可以有效地编码和解码。我们的构造是基于编码串联的一种新方法，将索引信息隐式嵌入码字中。这与以前的技术显著不同，可能独立地具有利益。最后，我们还证明了存在与随机线性码匹配的线性串联insdel码的参数，并提出了有关线性insdel码的猜想。