Levenshtein 距离的接近点,用于DNA储存 (Deep Squared Euclidean Approximation to the Levenshtein Distance for DNA Storage) - 专知论文

会员服务 ·

0

方阵 · Storage · 簇 · 近似 · CC ·

2022 年 7 月 11 日

Deep Squared Euclidean Approximation to the Levenshtein Distance for DNA Storage

翻译：Levenshtein 距离的接近点,用于DNA储存

Alan J. X. Guo,Cong Liang,Qing-Hu Hou

Storing information in DNA molecules is of great interest because of its advantages in longevity, high storage density, and low maintenance cost. A key step in the DNA storage pipeline is to efficiently cluster the retrieved DNA sequences according to their similarities. Levenshtein distance is the most suitable metric on the similarity between two DNA sequences, but it is inferior in terms of computational complexity and less compatible with mature clustering algorithms. In this work, we propose a novel deep squared Euclidean embedding for DNA sequences using Siamese neural network, squared Euclidean embedding, and chi-squared regression. The Levenshtein distance is approximated by the squared Euclidean distance between the embedding vectors, which is fast calculated and clustering algorithm friendly. The proposed approach is analyzed theoretically and experimentally. The results show that the proposed embedding is efficient and robust.

翻译：DNA分子中的信息存储非常有意义,因为它在长寿、高存储密度和低维护成本方面具有优势。 DNA存储管道中的一个关键步骤是根据相似之处有效地组合所回收的DNA序列。 Levenshtein 距离是衡量两个DNA序列之间相似性的最合适尺度,但在计算复杂性方面却不如计算性,而且与成熟的组群算法不相容。在这项工作中,我们提出了一个新的深方位的Euclidean 嵌入,用于DNA序列,使用Siams神经网络、平方 Euclidean 嵌入和基方回归。 Levenshtein 距离以嵌入矢体矢体之间的正方位 Euclidean 距离为近似,这是快速计算和组合算法友好的。对拟议方法进行了理论和实验分析。结果显示,提议的嵌入是高效和稳健的。

0

相关内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

人基因组远程调控元件的疾病表型和功能预测

国家自然科学基金

0+阅读 · 2014年12月31日

热应激下虹鳟mRNA差异表达及分子调控机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

信号通路XBP1-p21在细胞周期调控中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

猪GnIH与GnRH介导的细胞内信号通路交联节点调控促性腺激素转录的机制

国家自然科学基金

0+阅读 · 2013年12月31日

DNA损伤应激反应中变异剪接基因的鉴定及其功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Support vector machines and Radon's theorem

Arxiv

0+阅读 · 2022年9月2日

An Interpretable and Efficient Infinite-Order Vector Autoregressive Model for High-Dimensional Time Series

An Interpretable and Efficient Infinite-Order Vector Autoregressive Model for High-Dimensional Time Series

Arxiv

0+阅读 · 2022年9月2日

A solvable walking model for a two-legged robot

Arxiv

0+阅读 · 2022年9月2日

Constructing Embedded Lattice-based Algorithms for Multivariate Function Approximation with a Composite Number of Points

Arxiv

0+阅读 · 2022年9月2日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

VIP会员

文章信息

相关主题

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Support vector machines and Radon's theorem

Arxiv

0+阅读 · 2022年9月2日

An Interpretable and Efficient Infinite-Order Vector Autoregressive Model for High-Dimensional Time Series

An Interpretable and Efficient Infinite-Order Vector Autoregressive Model for High-Dimensional Time Series

Arxiv

0+阅读 · 2022年9月2日

A solvable walking model for a two-legged robot

Arxiv

0+阅读 · 2022年9月2日

Constructing Embedded Lattice-based Algorithms for Multivariate Function Approximation with a Composite Number of Points

Arxiv

0+阅读 · 2022年9月2日

A Modern Introduction to Online Learning

A Modern Introduction to Online Learning

Arxiv

21+阅读 · 2019年12月31日

相关基金

人基因组远程调控元件的疾病表型和功能预测

国家自然科学基金

0+阅读 · 2014年12月31日

热应激下虹鳟mRNA差异表达及分子调控机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

信号通路XBP1-p21在细胞周期调控中的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

猪GnIH与GnRH介导的细胞内信号通路交联节点调控促性腺激素转录的机制

国家自然科学基金

0+阅读 · 2013年12月31日

DNA损伤应激反应中变异剪接基因的鉴定及其功能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员