运行- 字符串编码字符串之间近似动态时间超速距离 (Approximating Dynamic Time Warping Distance Between Run-Length Encoded Strings) - 专知论文

会员服务 ·

0

近似 · 相似度度量 · Bioinformatics · Better · 相似度 ·

2022 年 7 月 2 日

Approximating Dynamic Time Warping Distance Between Run-Length Encoded Strings

翻译：运行- 字符串编码字符串之间近似动态时间超速距离

Zoe Xi,William Kuszmaul

from arxiv, A shorter version of this paper will be published in ESA 2022

Dynamic Time Warping (DTW) is a widely used similarity measure for comparing strings that encode time series data, with applications to areas including bioinformatics, signature verification, and speech recognition. The standard dynamic-programming algorithm for DTW takes $O(n^2)$ time, and there are conditional lower bounds showing that no algorithm can do substantially better. In many applications, however, the strings $x$ and $y$ may contain long runs of repeated letters, meaning that they can be compressed using run-length encoding. A natural question is whether the DTW-distance between these compressed strings can be computed efficiently in terms of the lengths $k$ and $\ell$ of the compressed strings. Recent work has shown how to achieve $O(k\ell^2 + \ell k^2)$ time, leaving open the question of whether a near-quadratic $\tilde{O}(k\ell)$-time algorithm might exist. We show that, if a small approximation loss is permitted, then a near-quadratic time algorithm is indeed possible: our algorithm computes a $(1 + \epsilon)$-approximation for $DTW(x, y)$ in $\tilde{O}(k\ell / \epsilon^3)$ time, where $k$ and $\ell$ are the number of runs in $x$ and $y$. Our algorithm allows for $DTW$ to be computed over any metric space $(\Sigma, \delta)$ in which distances are $O(log(n))$-bit integers. Surprisingly, the algorithm also works even if $\delta$ does not induce a metric space on $\Sigma$ (e.g., $\delta$ need not satisfy the triangle inequality).

翻译：动态时间扭曲( DTW) 是用来比较时间序列数据( 包括生物信息、签名验证和语音识别) 的字符串的广泛使用相似度量。 DTW 的标准动态程序程序算法需要O (n2) 美元的时间, 并且有有条件的下限, 这表明没有算法可以做的更好。然而, 在许多应用程序中, 字符x$ 和 $ 可能包含长长的重复字母, 这意味着它们可以使用运行长的编码压缩。一个自然的问题是, 这些压缩的字符串之间的 DTW 距离能否以长度( 美元) 和压缩的字符串的美元来有效计算。最近的工作表明, 如何实现 $( kell2 + k2) 和美元有条件的下限。在许多应用程序中, 字符x $ (k) 美元和美元美元的时间算法可能存在。我们显示, 如果允许任何小的近似损失, 那么这些压缩字符串之间的时间算算法是美元。

0

相关内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

几类平面微分系统的极限环分支

国家自然科学基金

0+阅读 · 2015年12月31日

有理 Krylov 子空间算法的最优参数选取

国家自然科学基金

0+阅读 · 2015年12月31日

大口径平面镜子孔径拼接检测中表面中高频误差的检测误差处理方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

番茄果实成熟相关Dicer-like 2c的调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于生物医学文献的隐含知识发现方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

多复变函数空间上的算子和全纯映照类的分析与几何性质

国家自然科学基金

0+阅读 · 2012年12月31日

Hamilton系统的辛几何算法和对称算法的定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Levy扩散过程与非局部偏微分方程

国家自然科学基金

1+阅读 · 2012年12月31日

c-Src激酶在2型糖尿病脑动脉BKCa通道功能障碍中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

Efficient Backward Reachability Using the Minkowski Difference of Constrained Zonotopes

Arxiv

0+阅读 · 2022年8月25日

An Improved Bernstein-type Inequality for C-Mixing-type Processes and Its Application to Kernel Smoothing

Arxiv

0+阅读 · 2022年8月24日

Scalable Linear Time Dense Direct Solver for 3-D Problems Without Trailing Sub-Matrix Dependencies

Arxiv

0+阅读 · 2022年8月23日

The Computational Complexity of ReLU Network Training Parameterized by Data Dimensionality

Arxiv

0+阅读 · 2022年8月23日

Efficient numerical method for reliable upper and lower bounds on homogenized parameters

Arxiv

0+阅读 · 2022年8月21日

An efficient acceleration technique for methods for finding the nearest point in a polytope and computing the distance between two polytopes

Arxiv

0+阅读 · 2022年8月20日

Approximating Symmetrized Estimators of Scatter via Balanced Incomplete U-Statistics

Arxiv

0+阅读 · 2022年8月19日

Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Arxiv

0+阅读 · 2022年8月19日

Merging Sorted Lists of Similar Strings

Arxiv

0+阅读 · 2022年8月19日

A Vector Fitting approach for the automated estimation of lumped boundary conditions of 1D circulation models

Arxiv

0+阅读 · 2022年8月14日

VIP会员

文章信息

相关主题

相似度度量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

相关论文

Efficient Backward Reachability Using the Minkowski Difference of Constrained Zonotopes

Arxiv

0+阅读 · 2022年8月25日

An Improved Bernstein-type Inequality for C-Mixing-type Processes and Its Application to Kernel Smoothing

Arxiv

0+阅读 · 2022年8月24日

Scalable Linear Time Dense Direct Solver for 3-D Problems Without Trailing Sub-Matrix Dependencies

Arxiv

0+阅读 · 2022年8月23日

The Computational Complexity of ReLU Network Training Parameterized by Data Dimensionality

Arxiv

0+阅读 · 2022年8月23日

Efficient numerical method for reliable upper and lower bounds on homogenized parameters

Arxiv

0+阅读 · 2022年8月21日

An efficient acceleration technique for methods for finding the nearest point in a polytope and computing the distance between two polytopes

Arxiv

0+阅读 · 2022年8月20日

Approximating Symmetrized Estimators of Scatter via Balanced Incomplete U-Statistics

Arxiv

0+阅读 · 2022年8月19日

Suboptimal Performance of the Bayes Optimal Algorithm in Frequentist Best Arm Identification

Arxiv

0+阅读 · 2022年8月19日

Merging Sorted Lists of Similar Strings

Arxiv

0+阅读 · 2022年8月19日

A Vector Fitting approach for the automated estimation of lumped boundary conditions of 1D circulation models

Arxiv

0+阅读 · 2022年8月14日

相关基金

AlGaN极化场调控对内量子效率的影响

国家自然科学基金

1+阅读 · 2016年12月31日

几类平面微分系统的极限环分支

国家自然科学基金

0+阅读 · 2015年12月31日

有理 Krylov 子空间算法的最优参数选取

国家自然科学基金

0+阅读 · 2015年12月31日

大口径平面镜子孔径拼接检测中表面中高频误差的检测误差处理方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

番茄果实成熟相关Dicer-like 2c的调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于生物医学文献的隐含知识发现方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

多复变函数空间上的算子和全纯映照类的分析与几何性质

国家自然科学基金

0+阅读 · 2012年12月31日

Hamilton系统的辛几何算法和对称算法的定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Levy扩散过程与非局部偏微分方程

国家自然科学基金

1+阅读 · 2012年12月31日

c-Src激酶在2型糖尿病脑动脉BKCa通道功能障碍中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员