优化代码检测串联二进制字符串中的删除操作并应用于轨迹重建 (Optimal Codes Detecting Deletions in Concatenated Binary Strings Applied to Trace Reconstruction) - 专知论文

会员服务 ·

0

重建 · 冗余 · 代码 · 操作 · 最优 ·

2023 年 4 月 19 日

Optimal Codes Detecting Deletions in Concatenated Binary Strings Applied to Trace Reconstruction

翻译：优化代码检测串联二进制字符串中的删除操作并应用于轨迹重建

Serge Kas Hanna

from arxiv, Accepted for publication in the IEEE Transactions on Information Theory. arXiv admin note: substantial text overlap with arXiv:2207.05126, arXiv:2105.00212

Consider two or more strings $\mathbf{x}^1,\mathbf{x}^2,\ldots,$ that are concatenated to form $\mathbf{x}=\langle \mathbf{x}^1,\mathbf{x}^2,\ldots \rangle$. Suppose that up to $\delta$ deletions occur in each of the concatenated strings. Since deletions alter the lengths of the strings, a fundamental question to ask is: how much redundancy do we need to introduce in $\mathbf{x}$ in order to recover the boundaries of $\mathbf{x}^1,\mathbf{x}^2,\ldots$? This boundary problem is equivalent to the problem of designing codes that can detect the exact number of deletions in each concatenated string. In this work, we answer the question above by first deriving converse results that give lower bounds on the redundancy of deletion-detecting codes. Then, we present a marker-based code construction whose redundancy is asymptotically optimal in $\delta$ among all families of deletion-detecting codes, and exactly optimal among all block-by-block decodable codes. To exemplify the usefulness of such deletion-detecting codes, we apply our code to trace reconstruction and design an efficient coded reconstruction scheme that requires a constant number of traces.

翻译：考虑两个或多个串 $\mathbf{x}^1,\mathbf{x}^2,\ldots$，它们被连接起来形成 $\mathbf{x}=\langle \mathbf{x}^1,\mathbf{x}^2,\ldots \rangle$。假设在每个连接的字符串中最多发生 $\delta$ 次删除操作。由于删除操作会改变字符串的长度，因此一个基本问题就是：我们需要在 $\mathbf{x}$ 中引入多少冗余信息才能恢复 $\mathbf{x}^1,\mathbf{x}^2,\ldots$ 的边界？这个边界问题等价于设计可检测每个连接字符串中确切删除操作数量的编码问题。在这项工作中，我们首先通过推导反证结果来得出删除检测代码的冗余下限。然后，我们提出一种基于标记的编码构造方案，其冗余在所有删除检测代码族中随着 $\delta$ 收敛于渐近最优，并且在所有分块可解码代码中达到完全最优。为了说明这种删除检测代码的实用性，我们将代码应用到轨迹重建中，设计了一个高效的编码重建方案，仅需要常数个轨迹。

0

相关内容

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

专知会员服务

95+阅读 · 2022年4月8日

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

专知会员服务

34+阅读 · 2022年3月4日

【Google-Marco Cuturi】最优传输，339页ppt，Optimal Transport

【Google-Marco Cuturi】最优传输，339页ppt，Optimal Transport

专知会员服务

48+阅读 · 2021年10月26日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

专知会员服务

25+阅读 · 2020年7月28日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

专知会员服务

135+阅读 · 2020年2月25日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

13+阅读 · 2019年11月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

卷积神经网络四种卷积类型

卷积神经网络四种卷积类型

炼数成金订阅号

18+阅读 · 2019年4月16日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

【论文推荐】最新十篇机器翻译相关论文—自然语言推理、无监督神经机器翻译、多任务学习、局部卷积、图卷积、多语种机器翻译

【论文推荐】最新十篇机器翻译相关论文—自然语言推理、无监督神经机器翻译、多任务学习、局部卷积、图卷积、多语种机器翻译

专知

15+阅读 · 2018年5月1日

已删除

科学网

60+阅读 · 2018年2月9日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

变应性鼻炎孕母暴露PM2.5对子代脐血Th2细胞亚群极化影响及其表观遗传调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

网络设计中的负载均衡问题

国家自然科学基金

0+阅读 · 2013年12月31日

有限域上多项式的p-进与T-进指数和

国家自然科学基金

0+阅读 · 2013年12月31日

基于协方差理论的UCT动态关联算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

符号模型与隐式状态模型检测技术

国家自然科学基金

1+阅读 · 2012年12月31日

面向功能ECO的不等价逻辑抽取方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向物联网的低功耗、多协议支持的处理节点研究

国家自然科学基金

1+阅读 · 2012年12月31日

de novo预测蛋白质结构的并行元启发方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

两类FIR滤波器的最优设计

国家自然科学基金

0+阅读 · 2009年12月31日

具低复杂度序列势的离散薛定谔算子谱结构

国家自然科学基金

0+阅读 · 2009年12月31日

Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images

Arxiv

0+阅读 · 2023年6月2日

Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization

Arxiv

0+阅读 · 2023年6月2日

Discreteness of asymptotic tensor ranks

Arxiv

0+阅读 · 2023年6月2日

No-dimensional Tverberg Partitions Revisited

Arxiv

0+阅读 · 2023年6月2日

Short rank-metric codes and scattered subspaces

Arxiv

0+阅读 · 2023年6月2日

A New Algebraic Approach for String Reconstruction from Substring Compositions

Arxiv

0+阅读 · 2023年6月1日

Improved Algorithms for Distance Selection and Related Problems

Arxiv

0+阅读 · 2023年6月1日

Gauss-Southwell type descent methods for low-rank matrix optimization

Arxiv

0+阅读 · 2023年6月1日

The Backpropagation algorithm for a math student

Arxiv

0+阅读 · 2023年5月31日

Learning Implicit Fields for Generative Shape Modeling

Learning Implicit Fields for Generative Shape Modeling

Arxiv

10+阅读 · 2018年12月6日

VIP会员

文章信息

相关主题

相关VIP内容

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

【Manning新书】自动机器学习实战，Automated Machine Learning in Action

专知会员服务

95+阅读 · 2022年4月8日

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

【EPFL-Nicolas Boumal新书】光滑流形优化导论，362页pdf，An introduction to optimization on smooth manifolds

专知会员服务

34+阅读 · 2022年3月4日

【Google-Marco Cuturi】最优传输，339页ppt，Optimal Transport

【Google-Marco Cuturi】最优传输，339页ppt，Optimal Transport

专知会员服务

48+阅读 · 2021年10月26日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

【优化基准：最佳实践，54页pdf】Benchmarking in Optimization: Best Practice and Open Issues

专知会员服务

25+阅读 · 2020年7月28日

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

【ICLR2020】用实对二进制卷积训练二进制神经网络，Training Binary Neural Networks with Real-to-Binary Convolutions

专知会员服务

26+阅读 · 2020年3月26日

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

【经典书】算法设计与分析，727页pdf，Algorithms Design and Analysis，牛津大学出版社

专知会员服务

135+阅读 · 2020年2月25日

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

【IPAM 】张量主元分析中的高维成本景观和梯度下降及其推广（High-dimensional cost landscape and gradient descent in Tensor PCA and its generalisations），附41页pdf

专知会员服务

13+阅读 · 2019年11月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图机器学习 2.2-2.4 Properties of Networks, Random Graph

图与推荐

10+阅读 · 2020年3月28日

卷积神经网络四种卷积类型

卷积神经网络四种卷积类型

炼数成金订阅号

18+阅读 · 2019年4月16日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

CVPR2019 | Stereo R-CNN 3D 目标检测

CVPR2019 | Stereo R-CNN 3D 目标检测

极市平台

27+阅读 · 2019年3月10日

【论文推荐】最新十篇机器翻译相关论文—自然语言推理、无监督神经机器翻译、多任务学习、局部卷积、图卷积、多语种机器翻译

【论文推荐】最新十篇机器翻译相关论文—自然语言推理、无监督神经机器翻译、多任务学习、局部卷积、图卷积、多语种机器翻译

专知

15+阅读 · 2018年5月1日

已删除

科学网

60+阅读 · 2018年2月9日

【CNN】一文读懂卷积神经网络CNN

【CNN】一文读懂卷积神经网络CNN

产业智能官

18+阅读 · 2018年1月2日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Self-supervised Interest Point Detection and Description for Fisheye and Perspective Images

Arxiv

0+阅读 · 2023年6月2日

Fast $(1+\varepsilon)$-Approximation Algorithms for Binary Matrix Factorization

Arxiv

0+阅读 · 2023年6月2日

Discreteness of asymptotic tensor ranks

Arxiv

0+阅读 · 2023年6月2日

No-dimensional Tverberg Partitions Revisited

Arxiv

0+阅读 · 2023年6月2日

Short rank-metric codes and scattered subspaces

Arxiv

0+阅读 · 2023年6月2日

A New Algebraic Approach for String Reconstruction from Substring Compositions

Arxiv

0+阅读 · 2023年6月1日

Improved Algorithms for Distance Selection and Related Problems

Arxiv

0+阅读 · 2023年6月1日

Gauss-Southwell type descent methods for low-rank matrix optimization

Arxiv

0+阅读 · 2023年6月1日

The Backpropagation algorithm for a math student

Arxiv

0+阅读 · 2023年5月31日

Learning Implicit Fields for Generative Shape Modeling

Learning Implicit Fields for Generative Shape Modeling

Arxiv

10+阅读 · 2018年12月6日

相关基金

变应性鼻炎孕母暴露PM2.5对子代脐血Th2细胞亚群极化影响及其表观遗传调控研究

国家自然科学基金

0+阅读 · 2015年12月31日

网络设计中的负载均衡问题

国家自然科学基金

0+阅读 · 2013年12月31日

有限域上多项式的p-进与T-进指数和

国家自然科学基金

0+阅读 · 2013年12月31日

基于协方差理论的UCT动态关联算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

符号模型与隐式状态模型检测技术

国家自然科学基金

1+阅读 · 2012年12月31日

面向功能ECO的不等价逻辑抽取方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向物联网的低功耗、多协议支持的处理节点研究

国家自然科学基金

1+阅读 · 2012年12月31日

de novo预测蛋白质结构的并行元启发方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

两类FIR滤波器的最优设计

国家自然科学基金

0+阅读 · 2009年12月31日

具低复杂度序列势的离散薛定谔算子谱结构

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员