Due to the redundant nature of DNA synthesis and sequencing technologies, a basic model for a DNA storage system is a multi-draw "shuffling-sampling" channel. In this model, a random number of noisy copies of each sequence is observed at the channel output. Recent works have characterized the capacity of such a DNA storage channel under different noise and sequencing models, relying on sophisticated typicality-based approaches for the achievability. Here, we consider a multi-draw DNA storage channel in the setting of noise corruption by a binary erasure channel. We show that, in this setting, the capacity is achieved by linear coding schemes. This leads to a considerably simpler derivation of the capacity expression of a multi-draw DNA storage channel than existing results in the literature.
翻译:由于DNA合成和测序技术的冗余性质,DNA储存系统的基本模式是多拖式“抽查”通道。在这一模型中,在频道输出中观察到每个序列的随机噪音复制件。最近的工作特征是这种DNA储存渠道在不同的噪音和测序模型下的能力,依靠复杂的基于典型特性的可实现性方法。在这里,我们考虑在通过二进制消除频道造成噪音腐败时采用多拖式DNA储存渠道。我们表明,在这种环境下,能力是通过线性编码办法实现的。这导致比文献中的现有结果更简单得多地衍生出多拖式DNA储存渠道的能力表现。