压缩模式的字符串索引 (String Indexing with Compressed Patterns) - 专知论文

会员服务 ·

0

可约的 · 线性的 · Processing（编程语言） · 相互独立的 · BASIC ·

2021 年 4 月 23 日

String Indexing with Compressed Patterns

翻译：压缩模式的字符串索引

Philip Bille,Inge Li Gørtz,Teresa Anna Steiner

from arxiv, Draft of full version, extended abstract appeared at STACS 2020

Given a string $S$ of length $n$, the classic string indexing problem is to preprocess $S$ into a compact data structure that supports efficient subsequent pattern queries. In this paper we consider the basic variant where the pattern is given in compressed form and the goal is to achieve query time that is fast in terms of the compressed size of the pattern. This captures the common client-server scenario, where a client submits a query and communicates it in compressed form to a server. Instead of the server decompressing the query before processing it, we consider how to efficiently process the compressed query directly. Our main result is a novel linear space data structure that achieves near-optimal query time for patterns compressed with the classic Lempel-Ziv compression scheme. Along the way we develop several data structural techniques of independent interest, including a novel data structure that compactly encodes all LZ77 compressed suffixes of a string in linear space and a general decomposition of tries that reduces the search time from logarithmic in the size of the trie to logarithmic in the length of the pattern.

翻译：根据一个长度为$n美元的字符串,典型的字符串索引问题在于将美元预先处理成一个紧凑的数据结构,支持高效的后续模式查询。在本文中,我们考虑基本变量,即该模式以压缩形式给出,目标是达到与该模式压缩大小相比快速的查询时间。这包含一个共同的客户端-服务器情景,即客户以压缩形式提交查询并将其传送到服务器。我们考虑的是,如何在处理前将查询解压缩,而不是服务器解压缩,而是如何直接处理压缩查询。我们的主要结果是一个新颖的线性空间数据结构,它为与经典的 Lempel-Ziv 压缩计划压缩模式相近最佳的查询时间。我们在此过程中,我们开发了几个独立感兴趣的数据结构技术,包括一个将线性空间中所有LZ77压缩后缀编码的新型数据结构,以及一个将缩短搜索时间从三角形大小的对数到图案长度的对数进行一般解剖。

0

相关内容

可约的

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

专知会员服务

25+阅读 · 2020年7月1日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

56+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

机器学习速查手册，135页pdf

机器学习速查手册，135页pdf

专知会员服务

345+阅读 · 2020年3月15日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Improving Metric Dimensionality Reduction with Distributed Topology

Arxiv

0+阅读 · 2021年6月14日

Very Compact Clusters with Structural Regularization via Similarity and Connectivity

Arxiv

0+阅读 · 2021年6月14日

Compressed Gradient Tracking for Decentralized Optimization Over General Directed Networks

Arxiv

0+阅读 · 2021年6月14日

Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning

Arxiv

0+阅读 · 2021年6月13日

Compressed Gradient Tracking Methods for Decentralized Optimization with Linear Convergence

Arxiv

0+阅读 · 2021年6月11日

Matching Patterns with Variables under Hamming Distance

Arxiv

0+阅读 · 2021年6月11日

Small space and streaming pattern matching with k edits

Arxiv

0+阅读 · 2021年6月10日

Compressed Sensing for Photoacoustic Computed Tomography Using an Untrained Neural Network

Arxiv

0+阅读 · 2021年5月29日

A Geometric Structure Associated with the Convex Polygon

Arxiv

0+阅读 · 2019年7月11日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

VIP会员

文章信息

相关主题

Processing（编程语言）

相互独立的

相关VIP内容

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

【ICML2020-伯克利-马毅老师组】深度等距学习的视觉识别，Deep Isometric Learning for Visual Recognition

专知会员服务

25+阅读 · 2020年7月1日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

【CMU-Spring2020课程】离散微分几何15讲，Discrete Differential Geometry

专知会员服务

56+阅读 · 2020年3月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

机器学习速查手册，135页pdf

机器学习速查手册，135页pdf

专知会员服务

345+阅读 · 2020年3月15日

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

【SIGMOD2020-CMU】在内存中搜索树的顺序保持键压缩，Order-Preserving Key Compression for In-Memory Search Trees

专知会员服务

15+阅读 · 2020年3月7日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

281+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Deep Compression/Acceleration：模型压缩加速论文汇总

Deep Compression/Acceleration：模型压缩加速论文汇总

极市平台

14+阅读 · 2019年5月15日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Improving Metric Dimensionality Reduction with Distributed Topology

Arxiv

0+阅读 · 2021年6月14日

Very Compact Clusters with Structural Regularization via Similarity and Connectivity

Arxiv

0+阅读 · 2021年6月14日

Compressed Gradient Tracking for Decentralized Optimization Over General Directed Networks

Arxiv

0+阅读 · 2021年6月14日

Rate-Distortion Theoretic Model Compression: Successive Refinement for Pruning

Arxiv

0+阅读 · 2021年6月13日

Compressed Gradient Tracking Methods for Decentralized Optimization with Linear Convergence

Arxiv

0+阅读 · 2021年6月11日

Matching Patterns with Variables under Hamming Distance

Arxiv

0+阅读 · 2021年6月11日

Small space and streaming pattern matching with k edits

Arxiv

0+阅读 · 2021年6月10日

Compressed Sensing for Photoacoustic Computed Tomography Using an Untrained Neural Network

Arxiv

0+阅读 · 2021年5月29日

A Geometric Structure Associated with the Convex Polygon

Arxiv

0+阅读 · 2019年7月11日

Testing Matrix Rank, Optimally

Arxiv

3+阅读 · 2018年10月18日

微信扫码咨询专知VIP会员