重复文本集中的电子计算机MEMS (Computing MEMs on Repetitive Text Collections) - 专知论文

会员服务 ·

0

MEMS · 确切的 · 泛函 · 极大 · 优化器 ·

2022 年 10 月 20 日

Computing MEMs on Repetitive Text Collections

翻译：重复文本集中的电子计算机MEMS

Gonzalo Navarro

We consider the problem of computing the Maximal Exact Matches (MEMs) of a given pattern $P[1..m]$ on a large repetitive text collection $T[1..n]$, which is represented as a (hopefully much smaller) run-length context-free grammar of size $g_{rl}$. We show that the problem can be solved in time $O(m^2 \log^\epsilon n)$, for any constant $\epsilon > 0$, on a data structure of size $O(g_{rl})$. Further, on a locally consistent grammar of size $O(\delta\log\frac{n}{\delta})$, the time decreases to $O(m\log m(\log m + \log^\epsilon n))$. The value $\delta$ is a function of the substring complexity of $T$ and $\Omega(\delta\log\frac{n}{\delta})$ is a tight lower bound on the compressibility of repetitive texts $T$, so our structure has optimal size in terms of $n$ and $\delta$.

翻译：我们考虑在大型重复文本收藏中计算一个特定模式[1.m]$P[1.m]$[1.n]$[1.n]$的最大具体匹配(MEM)的问题,它代表着(希望大大小得多的)不长的无背景语法,其大小为$g ⁇ rl}美元。我们表明,对于任何恒定的美元(m%2\log ⁇ epsilon n),问题可以及时解决。对于任何恒定的美元($) > 0美元的数据结构而言,美元(g ⁇ r}$)。此外,对于本地一致的大小($(delta\log\g\frac{nüdelta})$的语法,时间可以减少到$(m\logm m +\ log ⁇ cipslon n)美元。美元的价值是美元和美元(demega)的次质复杂性的函数。对于美元(delta\ max美元)的折合值结构来说,美元($)是紧的下限。

0

相关内容

MEMS

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

超宽量程CMOS MEMS气压传感器基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于MEMS技术的光栅结合可动Fabry-Perot腔微型光谱仪研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

硅光子学集成用Er silicate光波导放大器应用基础研究

国家自然科学基金

0+阅读 · 2009年12月31日

硅基MEMS声学Lamb波生化用质量传感器研究

国家自然科学基金

0+阅读 · 2008年12月31日

Non-linear Log-Sobolev inequalities for the Potts semigroup and applications to reconstruction problems

Arxiv

0+阅读 · 2022年12月2日

Game Implementation: What Are the Obstructions?

Arxiv

0+阅读 · 2022年12月1日

Dynamic Data Structures for $k$-Nearest Neighbor Queries

Arxiv

0+阅读 · 2022年12月1日

An Experiment Design Paradigm using Joint Feature Selection and Task Optimization

Arxiv

0+阅读 · 2022年11月29日

Adaptive Scenario Subset Selection for Worst-Case Optimization and its Application to Well Placement Optimization

Arxiv

0+阅读 · 2022年11月29日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

因果强化学习的统一框架：综述、分类体系、算法与应用

《无人机系统 - 反无人机系统：测试方法》364页

【MIT博士论文】语言模型的推理时学习算法

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

相关论文

Non-linear Log-Sobolev inequalities for the Potts semigroup and applications to reconstruction problems

Arxiv

0+阅读 · 2022年12月2日

Game Implementation: What Are the Obstructions?

Arxiv

0+阅读 · 2022年12月1日

Dynamic Data Structures for $k$-Nearest Neighbor Queries

Arxiv

0+阅读 · 2022年12月1日

An Experiment Design Paradigm using Joint Feature Selection and Task Optimization

Arxiv

0+阅读 · 2022年11月29日

Adaptive Scenario Subset Selection for Worst-Case Optimization and its Application to Well Placement Optimization

Arxiv

0+阅读 · 2022年11月29日

相关基金

超宽量程CMOS MEMS气压传感器基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于MEMS技术的光栅结合可动Fabry-Perot腔微型光谱仪研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

硅光子学集成用Er silicate光波导放大器应用基础研究

国家自然科学基金

0+阅读 · 2009年12月31日

硅基MEMS声学Lamb波生化用质量传感器研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员