Decoding sequences that stem from multiple transmissions of a codeword over an insertion, deletion, and substitution channel is a critical component of efficient deoxyribonucleic acid (DNA) data storage systems. In this paper, we consider a concatenated coding scheme with an outer nonbinary low-density parity-check code or a polar code and either an inner convolutional code or a time-varying block code. %In our scheme, we use an optimized inner convolutional code for the single sequence IDS channel based on its standalone maximum-likelihood performance. We propose two novel decoding algorithms for inference from multiple received sequences, both combining the inner code and channel to a joint hidden Markov model to infer symbolwise a posteriori probabilities (APPs). The first decoder computes the exact APPs by jointly decoding the received sequences, whereas the second decoder approximates the APPs by combining the results of separately decoded received sequences and has a complexity that is linear with the number of sequences. Using the proposed algorithms, we evaluate the performance of decoding multiple received sequences by means of achievable information rates and Monte-Carlo simulations. We show significant performance gains compared to a single received sequence. In addition, we succeed in improving the performance of the aforementioned coding scheme by optimizing both the inner and outer codes.
翻译:插入、删除和替代通道的多发编码传输产生的解码序列,是高效脱氧核糖核酸(DNA)数据存储系统的关键组成部分。在本文中,我们考虑采用外非二元低密度对等检查代码或极代代码的混合编码办法,或者采用内共代码,或者采用时间变化区块代码。%在我们的方法中,我们根据独立的最大相似性性性能,对IDS频道的单一序列使用最优化的内部共变码编码。我们建议采用两种新型解码算法,以便从收到的多个序列中推断出错,这两种系统将内部代码合并起来,并连接到共同隐藏的马尔科夫模型,以便将外差概率符号与后继性概率(APPs)相混合。第一个解码代码通过联合解码接收的序列来计算准确的应用程序,而第二个解码代码则通过将单独解码的序列的结果与所接收到的相近相近,其复杂性与所接收的序列数相直线。我们用拟议的业绩排序方法来显示一次业绩升级的进度,我们通过改进的进度图的进度图的进度,我们通过模拟来评估了一次业绩的进度的进度。我们评估了一次业绩的进度的进度的进度图。我们评估了一次的进度的进度图。