在最佳时间和空间中建设 Sprassar Sfix 树和 LCE 索引 (Construction of Sparse Suffix Trees and LCE Indexes in Optimal Time and Space) - 专知论文

会员服务 ·

0

稀疏 · 优化器 · Extensibility · 情景 · Pair ·

2021 年 7 月 12 日

Construction of Sparse Suffix Trees and LCE Indexes in Optimal Time and Space

翻译：在最佳时间和空间中建设 Sprassar Sfix 树和 LCE 索引

Dmitry Kosolobov,Nikita Sivukhin

from arxiv, 27 pages, 3 figures

The notions of synchronizing and partitioning sets are recently introduced variants of locally consistent parsings with great potential in problem-solving. In this paper we propose a deterministic algorithm that constructs for a given readonly string of length $n$ over the alphabet $\{0,1,\ldots,n^{\mathcal{O}(1)}\}$ a variant of $\tau$-partitioning set with size $\mathcal{O}(b)$ and $\tau = \frac{n}{b}$ using $\mathcal{O}(b)$ space and $\mathcal{O}(\frac{1}{\epsilon}n)$ time provided $b \ge n^\epsilon$, for $\epsilon > 0$. As a corollary, for $b \ge n^\epsilon$ and constant $\epsilon > 0$, we obtain linear construction algorithms with $\mathcal{O}(b)$ space on top of the string for two major small-space indexes: a sparse suffix tree, which is a compacted trie built on $b$ chosen suffixes of the string, and a longest common extension (LCE) index, which occupies $\mathcal{O}(b)$ space and allows us to compute the longest common prefix for any pair of substrings in $\mathcal{O}(n/b)$ time. For both, the $\mathcal{O}(b)$ construction storage is asymptotically optimal since the tree itself takes $\mathcal{O}(b)$ space and any LCE index with $\mathcal{O}(n/b)$ query time must occupy at least $\mathcal{O}(b)$ space by a known trade-off (at least for $b \ge \Omega(n / \log n)$). In case of arbitrary $b \ge \Omega(\log^2 n)$, we present construction algorithms for the partitioning set, sparse suffix tree, and LCE index with $\mathcal{O}(n\log_b n)$ running time and $\mathcal{O}(b)$ space, thus also improving the state of the art.

翻译：同步和分区设置的概念是最近引入的本地一致解析的变量 (%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

0

相关内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

专知会员服务

173+阅读 · 2020年11月13日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

149+阅读 · 2020年4月20日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

3+阅读 · 2019年4月12日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

A Characterization of Individualization-Refinement Trees

Arxiv

0+阅读 · 2021年9月15日

Reconstruction on Trees and Low-Degree Polynomials

Arxiv

0+阅读 · 2021年9月14日

Optimizing the ecological connectivity of landscapes with generalized flow models and preprocessing

Arxiv

0+阅读 · 2021年9月14日

$\varepsilon$-isometric dimension reduction for incompressible subsets of $\ell_p$

Arxiv

0+阅读 · 2021年9月14日

New Extremal Binary Self-Dual Codes of Length 72 from $M_6(\mathbb{F}_2)G$ - Group Matrix Rings by a Hybrid Search Technique Based on a Neighbourhood-Virus Optimisation Algorithm

Arxiv

0+阅读 · 2021年9月14日

Efficient Sampling of Dependency Structures

Arxiv

0+阅读 · 2021年9月14日

Unitarization Through Approximate Basis

Arxiv

0+阅读 · 2021年9月13日

On Some Problems of Confidence Region Construction

Arxiv

0+阅读 · 2021年9月11日

The Labeled Direct Product Optimally Solves String Problems on Graphs

Arxiv

0+阅读 · 2021年9月11日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

VIP会员

文章信息

相关主题

相关VIP内容

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

【最受欢迎的概率书】《概率论：理论与实例》，490页pdf

专知会员服务

173+阅读 · 2020年11月13日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

54+阅读 · 2020年9月7日

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

【硬核书】信息论，528页pdf，Information Theory and Coding by Example

专知会员服务

149+阅读 · 2020年4月20日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

已删除

将门创投

3+阅读 · 2019年4月12日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

A Characterization of Individualization-Refinement Trees

Arxiv

0+阅读 · 2021年9月15日

Reconstruction on Trees and Low-Degree Polynomials

Arxiv

0+阅读 · 2021年9月14日

Optimizing the ecological connectivity of landscapes with generalized flow models and preprocessing

Arxiv

0+阅读 · 2021年9月14日

$\varepsilon$-isometric dimension reduction for incompressible subsets of $\ell_p$

Arxiv

0+阅读 · 2021年9月14日

New Extremal Binary Self-Dual Codes of Length 72 from $M_6(\mathbb{F}_2)G$ - Group Matrix Rings by a Hybrid Search Technique Based on a Neighbourhood-Virus Optimisation Algorithm

Arxiv

0+阅读 · 2021年9月14日

Efficient Sampling of Dependency Structures

Arxiv

0+阅读 · 2021年9月14日

Unitarization Through Approximate Basis

Arxiv

0+阅读 · 2021年9月13日

On Some Problems of Confidence Region Construction

Arxiv

0+阅读 · 2021年9月11日

The Labeled Direct Product Optimally Solves String Problems on Graphs

Arxiv

0+阅读 · 2021年9月11日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

微信扫码咨询专知VIP会员