重复文本集中的电子计算机MEMS (Computing MEMs on Repetitive Text Collections) - 专知论文

会员服务 ·

0

MEMS · 确切的 · 泛函 · 极大 · 优化器 ·

2022 年 11 月 10 日

Computing MEMs on Repetitive Text Collections

翻译：重复文本集中的电子计算机MEMS

Gonzalo Navarro

We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern $P[1..m]$ on a large repetitive text collection $T[1..n]$, which is represented as a (hopefully much smaller) run-length context-free grammar of size $g_{rl}$. We show that the problem can be solved in time $O(m^2 \log^\epsilon n)$, for any constant $\epsilon > 0$, on a data structure of size $O(g_{rl})$. Further, on a locally consistent grammar of size $O(\delta\log\frac{n}{\delta})$, the time decreases to $O(m\log m(\log m + \log^\epsilon n))$. The value $\delta$ is a function of the substring complexity of $T$ and $\Omega(\delta\log\frac{n}{\delta})$ is a tight lower bound on the compressibility of repetitive texts $T$, so our structure has optimal size in terms of $n$ and $\delta$. We extend our results to the problem of finding $q$-MEMs, which must appear at least $q$ times in $T$.

翻译：我们考虑在大型重复文本收藏中计算某种模式[1.m]$P[1.m]$[1.n]$[1.n]$的最大具体匹配(MEM)的问题,它代表着一种(希望大大小得多的)不长的无背景语法,其大小为$g ⁇ rl}美元。我们表明,对于任何恒定的美元($%2\log ⁇ epsilon n)来说,问题可以及时解决。对于任何恒定的美元($) > 0美元的数据结构来说,美元($O(g ⁇ r})美元。此外,对于本地一致的大小($($)[$(delta\log\g\g\g\frac{n=delta})$($))的语法,时间可以缩短为$(m\log m(m) +\log ⁇ cipslon n) 美元。美元值是美元和美元($($(delta\\ grang)美元)美元($)的分数的函数。对于我们反复文本的最佳结构来说,我们必须在美元中找到美元($($)的大小。

0

相关内容

MEMS

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于NIR-QDs的受体免标记FRET新方法及其临床血样分析

国家自然科学基金

0+阅读 · 2013年12月31日

AFP源生长抑制肽-GIP在先天性脊柱裂发病过程中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

表面解吸常压化学电离源与离子迁移谱联用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

同轴静电纺丝制备核-壳型ATP超微电极

国家自然科学基金

0+阅读 · 2012年12月31日

PGM1基因在肝细胞癌中的抑癌功能及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

A MUSCL-like finite volumes approximation of the momentum convection operator for low-order nonconforming face-centred discretizations

Arxiv

0+阅读 · 2023年1月5日

Task-Effective Compression of Observations for the Centralized Control of a Multi-agent System Over Bit-Budgeted Channels

Arxiv

0+阅读 · 2023年1月4日

Graphical House Allocation

Arxiv

0+阅读 · 2023年1月3日

MATCH: Metadata-Aware Text Classification in A Large Hierarchy

Arxiv

12+阅读 · 2021年2月15日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

A MUSCL-like finite volumes approximation of the momentum convection operator for low-order nonconforming face-centred discretizations

Arxiv

0+阅读 · 2023年1月5日

Task-Effective Compression of Observations for the Centralized Control of a Multi-agent System Over Bit-Budgeted Channels

Arxiv

0+阅读 · 2023年1月4日

Graphical House Allocation

Arxiv

0+阅读 · 2023年1月3日

MATCH: Metadata-Aware Text Classification in A Large Hierarchy

Arxiv

12+阅读 · 2021年2月15日

Contrastive Clustering

Arxiv

31+阅读 · 2020年9月21日

相关基金

基于NIR-QDs的受体免标记FRET新方法及其临床血样分析

国家自然科学基金

0+阅读 · 2013年12月31日

AFP源生长抑制肽-GIP在先天性脊柱裂发病过程中的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

表面解吸常压化学电离源与离子迁移谱联用的研究

国家自然科学基金

0+阅读 · 2012年12月31日

同轴静电纺丝制备核-壳型ATP超微电极

国家自然科学基金

0+阅读 · 2012年12月31日

PGM1基因在肝细胞癌中的抑癌功能及分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员