Recently, several works have studied the problem of view selection in graph databases. However, existing methods cannot fully exploit the graph properties of views, e.g., supergraph views and common subgraph views, which leads to a low view utility and duplicate view content. To address the problem, we propose an end-to-end graph view selection tool, G-View, which can judiciously generate a view set from a query workload by exploring the graph properties of candidate views and considering their efficacy. Specifically, given a graph query set and a space budget, G-View translates each query to a candidate view pattern and checks the query containment via a filtering-and-verification framework. G-View then selects the views using a graph gene algorithm (GGA), which relies on a three-phase framework that explores graph view transformations to reduce the view space and optimize the view benefit. Finally, G-View generates the extended graph views that persist all the edge-induced subgraphs to answer the subgraph and supergraph queries simultaneously. Extensive experiments on real-life and synthetic datasets demonstrated G-View achieved averagely 21x and 2x query performance speedup over two view-based methods while having 2x and 5x smaller space overhead, respectively. Moreover, the proposed selection algorithm, GGA, outperformed other selection methods in both effectiveness and efficiency.
翻译:最近,有几项工作研究了图表数据库中的视图选择问题,然而,现有方法无法充分利用观点的图形属性,例如,超光谱视图和共同子集观点,从而导致低视图效用和重复视图内容。为了解决这个问题,我们提议了一个端到端图形选择工具G-View,它可以明智地从查询工作量中生成一个视图,通过探索候选人观点的图形属性和考虑其效力,从而通过探索候选人观点的图表属性并同时考虑其效力,从而从查询工作量中生成一个视图。具体地说,根据一个图表查询数据集和空间预算,G-View将每个查询转换成候选人视图模式,并通过过滤和核实框架检查查询封存。G-View然后使用图表基因算法(GGA)选择这些视图。G-View 使用一个三阶段框架来探索图形视图转换,以缩小视图空间空间空间空间和优化视图效益。最后,G-View生成了长的图表视图视图视图,同时回答子图和超光谱查询。关于真实生活和合成数据集的广泛实验展示了平均21x和2x所实现的G-Vify 和2x 快速浏览选择方法,另外两个系统选择了GAx。