We study permutations over the set of $\ell$-grams, that are feasible in the sense that there is a sequence whose $\ell$-gram frequency has the same ranking as the permutation. Codes, which are sets of feasible permutations, protect information stored in DNA molecules using the rank-modulation scheme, and read using the shotgun sequencing technique. We construct systematic codes with an efficient encoding algorithm, and show that they are optimal in size. The length of the DNA sequences that correspond to the codewords is shown to be polynomial in the code parameters. Non-systematic with larger size are also constructed.
翻译:我们研究一套$/ ell$- gram($/ ell$- gram) 的变异,这是可行的,因为有一个序列的 $/ ell$- gram($/ ell$- gram) 频率与变异的等级相同。代码是一套可行的变异组合,用来保护使用等级调制方案储存在DNA分子中的信息,并使用猎枪排序技术来阅读。我们用高效的编码算法来构建系统代码,并显示这些代码的尺寸是最佳的。在代码参数中,匹配的DNA序列的长度显示是多数值的。还建造了规模较大的非系统代码。