Decoding sequences that stem from multiple transmissions of a codeword over an insertion, deletion, and substitution channel is a critical component of efficient deoxyribonucleic acid (DNA) data storage systems. In this paper, we consider a concatenated coding scheme with an outer nonbinary low-density parity-check code or a polar code and either an inner convolutional code or a time-varying block code. We propose two novel decoding algorithms for inference from multiple received sequences, both combining the inner code and channel to a joint hidden Markov model to infer symbolwise a posteriori probabilities (APPs). The first decoder computes the exact APPs by jointly decoding the received sequences, whereas the second decoder approximates the APPs by combining the results of separately decoded received sequences and has a complexity that is linear with the number of sequences. Using the proposed algorithms, we evaluate the performance of decoding multiple received sequences by means of achievable information rates and Monte-Carlo simulations. We show significant performance gains compared to a single received sequence. In addition, we succeed in improving the performance of the aforementioned coding scheme by optimizing both the inner and outer codes.
翻译:插入、 删除和替换通道的编码多次传输产生的解码序列,是高效脱氧核糖核酸(DNA)数据储存系统的关键组成部分。 在本文中,我们考虑的是外非二元低密度对等检查代码或极代代码的混合编码办法,以及内部共变代码或时间变化区块代码。我们提议了两种新型解码算法,用以推断从多个收到序列中得出的推断,两者结合了内部编码和连接到一个联合隐藏的Markov模型的渠道,以推断后继概率(APPs)的符号。第一个解码器通过联合解码收到的序列,对精确的APps进行了计算,而第二个解码器则通过将单独解码收到序列的结果合并来接近APs,并且具有线性的复杂性。我们利用拟议的算法,通过可实现的信息率和蒙特-卡尔洛模拟来评估解码多个收到序列的性能。我们通过改进内部代码的成绩,我们通过改进内部性能,通过改进一个单一序列来改进内部性能。