The errors occurring in DNA-based storage are correlated in nature, which is a direct consequence of the synthesis and sequencing processes. In this paper, we consider the memory-$k$ nanopore channel model recently introduced by Hamoum et al., which models the inherent memory of the channel. We derive the maximum a posteriori (MAP) decoder for this channel model. The derived MAP decoder allows us to compute achievable information rates for the true DNA storage channel assuming a mismatched decoder matched to the memory-$k$ nanopore channel model, and quantify the loss in performance assuming a small memory length--and hence limited decoding complexity. Furthermore, the derived MAP decoder can be used to design error-correcting codes tailored to the DNA storage channel. We show that a concatenated coding scheme with an outer low-density parity-check code and an inner convolutional code yields excellent performance.
翻译:染色体基于DNA的存储的错误具有相关性,这是合成和测序过程的直接结果。本文考虑了Hamoum等人最近引入的存储器-k核孔道模型,该模型模拟了通道的固有记忆。我们推导了该通道模型的最大后验概率(MAP)译码器,并针对真实的DNA存储通道使用不匹配的解码器匹配于存储器-k nanopore通道模型,计算可实现的信息速率,并量化在具有小内存长度(因此受限的解码复杂度)的情况下的性能损失。此外,推导的MAP译码器可以用于设计适用于DNA存储通道的纠错码。我们表明,一个具有外部低密度奇偶校验码和内部卷积码的串联编码方案具有出色的性能。