EmbAssi:大型图表数据库中类似搜索的嵌入式分配费用 (EmbAssi: Embedding Assignment Costs for Similarity Search in Large Graph Databases) - 专知论文

会员服务 ·

0

图 · 可约的 · 确切的 · 相似度 · 代价函数 ·

2022 年 4 月 20 日

EmbAssi: Embedding Assignment Costs for Similarity Search in Large Graph Databases

翻译：EmbAssi:大型图表数据库中类似搜索的嵌入式分配费用

Franka Bause,Erich Schubert,Nils M. Kriege

The graph edit distance is an intuitive measure to quantify the dissimilarity of graphs, but its computation is NP-hard and challenging in practice. We introduce methods for answering nearest neighbor and range queries regarding this distance efficiently for large databases with up to millions of graphs. We build on the filter-verification paradigm, where lower and upper bounds are used to reduce the number of exact computations of the graph edit distance. Highly effective bounds for this involve solving a linear assignment problem for each graph in the database, which is prohibitive in massive datasets. Index-based approaches typically provide only weak bounds leading to high computational costs verification. In this work, we derive novel lower bounds for efficient filtering from restricted assignment problems, where the cost function is a tree metric. This special case allows embedding the costs of optimal assignments isometrically into $\ell_1$ space, rendering efficient indexing possible. We propose several lower bounds of the graph edit distance obtained from tree metrics reflecting the edit costs, which are combined for effective filtering. Our method termed EmbAssi can be integrated into existing filter-verification pipelines as a fast and effective pre-filtering step. Empirically we show that for many real-world graphs our lower bounds are already close to the exact graph edit distance, while our index construction and search scales to very large databases.

翻译：图形编辑距离是量化图表不同之处的直观尺度,但它的计算方法是硬性NP,在实践中具有挑战性。我们引入了对最近的邻居进行回答的方法,并针对具有上百万图的大型数据库高效地对距离进行范围查询。我们以过滤核查范式为基础,使用下限和上界来减少图形编辑距离的精确计算数量。为此,非常有效的界限涉及解决数据库中每个图表的线性分配问题,这在庞大的数据集中是令人窒息的。基于索引的方法通常只能提供导致计算成本高的薄弱界限。在这项工作中,我们从有限的任务问题(成本函数是树度指标)中获取高效过滤的更近距离查询。这个特殊案例允许将最佳任务的成本以直径直径方式嵌入$\ell_1美元的空间,从而可以有效地编制索引。我们从反映编辑成本的树度测量图中获得的几条更低的线。我们称为EmbAsiti的方法可以纳入现有的过滤器-核实成本高的校验路径中。我们称为Embassi的工作可以作为快速和精确的图表,同时显示我们快速和精确的深度的深度的深度的深度的深度,我们可以显示我们所建的深度的深度的深度的深度的深度的深度的深度。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

非线性Hamiltonian 系统高效谱方法及其应用

国家自然科学基金

0+阅读 · 2017年12月31日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

猪长链非编码RNA-ED1与miR-21的互作及其对骨骼肌生长发育的调控

国家自然科学基金

0+阅读 · 2015年12月31日

非负矩阵张量积保持问题的研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

家蚕miRNAs对丝胶基因Ser-1、Ser-2的转录后调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

4H碳化硅PNPN自支撑复合膜的HTCVD生长研究

国家自然科学基金

0+阅读 · 2012年12月31日

CK2磷酸化抑制 TAp73促进骨肉瘤干细胞增殖的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

活性维生素D通过调节细胞衰老发挥抗衰老和抗肿瘤双重作用的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

Arxiv

0+阅读 · 2022年6月10日

Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals

Arxiv

0+阅读 · 2022年6月10日

Boosting Search Engines with Interactive Agents

Arxiv

0+阅读 · 2022年6月7日

Decentralized Online Regularized Learning Over Random Time-Varying Graphs

Arxiv

0+阅读 · 2022年6月7日

Interest-aware Message-Passing GCN for Recommendation

Interest-aware Message-Passing GCN for Recommendation

Arxiv

12+阅读 · 2021年2月19日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Arxiv

12+阅读 · 2020年4月15日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba

Arxiv

15+阅读 · 2018年5月24日

VIP会员

文章信息

相关主题

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

扩散语言模型综述

《美陆军徒步机动作战条令手册》最新168页

【博士论文】理解神经网络的训练动态：从局部优化轨迹与特征学习视角

军事后勤数字化未来展望

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

【新书发布】原作者MarcG.Bellemare发布315页分布强化学习书籍(DistributionalRL)

深度强化学习实验室

1+阅读 · 2022年1月11日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

List-Decodable Sparse Mean Estimation via Difference-of-Pairs Filtering

Arxiv

0+阅读 · 2022年6月10日

Comparison and Combination of Sentence Embeddings Derived from Different Supervision Signals

Arxiv

0+阅读 · 2022年6月10日

Boosting Search Engines with Interactive Agents

Arxiv

0+阅读 · 2022年6月7日

Decentralized Online Regularized Learning Over Random Time-Varying Graphs

Arxiv

0+阅读 · 2022年6月7日

Interest-aware Message-Passing GCN for Recommendation

Interest-aware Message-Passing GCN for Recommendation

Arxiv

12+阅读 · 2021年2月19日

Embedding-based Retrieval in Facebook Search

Arxiv

12+阅读 · 2020年6月20日

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Arxiv

12+阅读 · 2020年4月15日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

From Knowledge Graph Embedding to Ontology Embedding: Region Based Representations of Relational Structures

Arxiv

10+阅读 · 2018年5月26日

Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba

Arxiv

15+阅读 · 2018年5月24日

相关基金

非线性Hamiltonian 系统高效谱方法及其应用

国家自然科学基金

0+阅读 · 2017年12月31日

Hamilton-Jacibi方程的弱KAM理论

国家自然科学基金

2+阅读 · 2017年12月31日

猪长链非编码RNA-ED1与miR-21的互作及其对骨骼肌生长发育的调控

国家自然科学基金

0+阅读 · 2015年12月31日

非负矩阵张量积保持问题的研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

家蚕miRNAs对丝胶基因Ser-1、Ser-2的转录后调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

4H碳化硅PNPN自支撑复合膜的HTCVD生长研究

国家自然科学基金

0+阅读 · 2012年12月31日

CK2磷酸化抑制 TAp73促进骨肉瘤干细胞增殖的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

活性维生素D通过调节细胞衰老发挥抗衰老和抗肿瘤双重作用的机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员