Proteus: 自设计区域过滤器 (Proteus: A Self-Designing Range Filter) - 专知论文

会员服务 ·

0

Performer · 值域 · state-of-the-art · 稳健性 · SURF ·

2022 年 6 月 30 日

Proteus: A Self-Designing Range Filter

翻译：Proteus: 自设计区域过滤器

Eric R. Knorr,Baptiste Lemaire,Andrew Lim,Siqiang Luo,Huanchen Zhang,Stratos Idreos,Michael Mitzenmacher

from arxiv, 14 pages, 9 figures, originally published in the Proceedings of the 2022 International Conference on Management of Data (SIGMOD'22), ISBN: 9781450392495

We introduce Proteus, a novel self-designing approximate range filter, which configures itself based on sampled data in order to optimize its false positive rate (FPR) for a given space requirement. Proteus unifies the probabilistic and deterministic design spaces of state-of-the-art range filters to achieve robust performance across a larger variety of use cases. At the core of Proteus lies our Contextual Prefix FPR (CPFPR) model - a formal framework for the FPR of prefix-based filters across their design spaces. We empirically demonstrate the accuracy of our model and Proteus' ability to optimize over both synthetic workloads and real-world datasets. We further evaluate Proteus in RocksDB and show that it is able to improve end-to-end performance by as much as 5.3x over more brittle state-of-the-art methods such as SuRF and Rosetta. Our experiments also indicate that the cost of modeling is not significant compared to the end-to-end performance gains and that Proteus is robust to workload shifts.

翻译：我们引入了普罗特斯(Proteus),这是一个全新的自我设计近似范围过滤器,它基于抽样数据进行自我配置,以优化对特定空间要求的假正率(FPR ) 。普罗特斯(Proteus)统一了最先进的范围过滤器的概率和确定性设计空间,以便在更广泛的各种使用案例中实现强效。普罗特斯(Proteus)的核心是我们的“背景前端过滤器”(CPFPR)模型——一个正式的框架,用于其设计空间的基于前端过滤器的FPR。我们从经验上展示了我们模型和普罗特斯(Proteus)在合成工作量和真实世界数据集方面优化的准确性。我们进一步评估了罗克斯DB的Proteus(Proteus),并表明它能够通过5.3x(5.3x)来改善端端到端端端端的性功能,比如SuRF和罗塞塔(Rosetta)。我们的实验还表明,建模成本与端端端端端到端业绩收益相比并不大,普罗特斯(Proteus)也能够工作量变化。

0

相关内容

Performer

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

专知会员服务

20+阅读 · 2020年11月13日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

65+阅读 · 2020年7月25日

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

专知会员服务

88+阅读 · 2020年7月9日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

105+阅读 · 2020年6月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

94+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

53+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

56+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

81+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

26+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

27+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新八篇推荐系统相关论文—可解释推荐、上下文感知推荐系统、异构知识库嵌入、深度强化学习、移动推荐系统

【论文推荐】最新八篇推荐系统相关论文—可解释推荐、上下文感知推荐系统、异构知识库嵌入、深度强化学习、移动推荐系统

专知

17+阅读 · 2018年6月16日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

大岩桐叶序调控相关基因的克隆及功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

抗性小麦应答孢囊线虫侵染的ROS相关基因的分离与功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

基于压电驱动的多自由度微操作器精密定位及振动抑振一体化控制理论与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

citron kinase促进HIV-1病毒颗粒包装出芽机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

免疫相关基因与慢性非细菌性前列腺炎的关联分析及关键靶点的生物功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于图上随机游走、渗流的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

玉米大斑病菌水甘油通道蛋白StFps1基因的克隆与功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

蜡样芽孢杆菌Bacillus cereus 905锰超氧化物歧化酶（MnSOD）基因表达调控途径的研究

国家自然科学基金

0+阅读 · 2011年12月31日

福氏志贺氏菌HtrA蛋白功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

西班牙流感病毒膜蛋白PB1-F2的结构及功能研究

国家自然科学基金

0+阅读 · 2010年12月31日

Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning

Arxiv

0+阅读 · 2022年8月24日

Fairness for AUC via Feature Augmentation

Arxiv

0+阅读 · 2022年8月24日

Demeter: A Fast and Energy-Efficient Food Profiler using Hyperdimensional Computing in Memory

Arxiv

0+阅读 · 2022年8月24日

KEEP: An Industrial Pre-Training Framework for Online Recommendation via Knowledge Extraction and Plugging

Arxiv

0+阅读 · 2022年8月22日

DAFT: Distilling Adversarially Fine-tuned Models for Better OOD Generalization

Arxiv

0+阅读 · 2022年8月19日

LogStamp: Automatic Online Log Parsing Based on Sequence Labelling

Arxiv

0+阅读 · 2022年8月10日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

Arxiv

14+阅读 · 2021年2月16日

Explainable Recommender Systems via Resolving Learning Representations

Arxiv

13+阅读 · 2020年8月21日

Attributed Graph Clustering via Adaptive Graph Convolution

Arxiv

11+阅读 · 2019年6月4日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

专知会员服务

20+阅读 · 2020年11月13日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

65+阅读 · 2020年7月25日

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

【KDD2020】基于知识图谱的语义融合改进会话推荐系统，Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion

专知会员服务

88+阅读 · 2020年7月9日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

105+阅读 · 2020年6月10日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

94+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

53+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

18+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

56+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

81+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

64+阅读 · 2019年10月9日

热门VIP内容

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

26+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

27+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

【论文推荐】最新八篇推荐系统相关论文—可解释推荐、上下文感知推荐系统、异构知识库嵌入、深度强化学习、移动推荐系统

【论文推荐】最新八篇推荐系统相关论文—可解释推荐、上下文感知推荐系统、异构知识库嵌入、深度强化学习、移动推荐系统

专知

17+阅读 · 2018年6月16日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning

Arxiv

0+阅读 · 2022年8月24日

Fairness for AUC via Feature Augmentation

Arxiv

0+阅读 · 2022年8月24日

Demeter: A Fast and Energy-Efficient Food Profiler using Hyperdimensional Computing in Memory

Arxiv

0+阅读 · 2022年8月24日

KEEP: An Industrial Pre-Training Framework for Online Recommendation via Knowledge Extraction and Plugging

Arxiv

0+阅读 · 2022年8月22日

DAFT: Distilling Adversarially Fine-tuned Models for Better OOD Generalization

Arxiv

0+阅读 · 2022年8月19日

LogStamp: Automatic Online Log Parsing Based on Sequence Labelling

Arxiv

0+阅读 · 2022年8月10日

Updating Embeddings for Dynamic Knowledge Graphs

Arxiv

20+阅读 · 2021年9月22日

GraphNorm: A Principled Approach to Accelerating Graph Neural Network Training

Arxiv

14+阅读 · 2021年2月16日

Explainable Recommender Systems via Resolving Learning Representations

Arxiv

13+阅读 · 2020年8月21日

Attributed Graph Clustering via Adaptive Graph Convolution

Arxiv

11+阅读 · 2019年6月4日

相关基金

大岩桐叶序调控相关基因的克隆及功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

抗性小麦应答孢囊线虫侵染的ROS相关基因的分离与功能分析

国家自然科学基金

0+阅读 · 2013年12月31日

基于压电驱动的多自由度微操作器精密定位及振动抑振一体化控制理论与方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

citron kinase促进HIV-1病毒颗粒包装出芽机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

免疫相关基因与慢性非细菌性前列腺炎的关联分析及关键靶点的生物功能研究

国家自然科学基金

0+阅读 · 2012年12月31日

关于图上随机游走、渗流的几个问题

国家自然科学基金

0+阅读 · 2012年12月31日

玉米大斑病菌水甘油通道蛋白StFps1基因的克隆与功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

蜡样芽孢杆菌Bacillus cereus 905锰超氧化物歧化酶（MnSOD）基因表达调控途径的研究

国家自然科学基金

0+阅读 · 2011年12月31日

福氏志贺氏菌HtrA蛋白功能研究

国家自然科学基金

0+阅读 · 2011年12月31日

西班牙流感病毒膜蛋白PB1-F2的结构及功能研究

国家自然科学基金

0+阅读 · 2010年12月31日

微信扫码咨询专知VIP会员