Error-correcting codes over sets, with applications to DNA storage, are studied. The DNA-storage channel receives a set of sequences, and produces a corrupted version of the set, including sequence loss, symbol substitution, symbol insertion/deletion, and limited-magnitude errors in symbols. Various parameter regimes are studied. New bounds on code parameters are provided, which improve upon known bounds. New codes are constructed, at times matching the bounds up to lower-or der terms or small constant factors.
翻译:正在研究对数据集错误校正代码,并应用到DNA存储。DNA存储频道接收一系列序列,并生成一个损坏的数据集版本,包括序列丢失、符号替换、符号插入/删除和符号中的有限放大错误。研究了各种参数体系。提供了代码参数的新界限,这些界限在已知的界限上得到了改进。新代码的构建,有时与下或下或下条件或小常数要素的界限相匹配。