Graph database query languages feature expressive, yet computationally expensive pattern matching capabilities. Answering optional query clauses in SPARQL for instance renders the query evaluation problem immediately Pspace-complete. Therefore, light-weight graph pattern matching relations, such as simulation, have recently been investigated as promising alternatives to more expensive query mechanisms like, e.g., computing subgraph isomorphism. Still, graph pattern matching alone lacks expressive query capabilities: all patterns are combined by usual join constructs, where more sophisticated capabilities would be inevitable for making solutions useful to emerging applications. In this paper we bridge this gap by introducing a new dual simulation process operating on SPARQL queries. In addition to supporting the full syntactic structure of SPARQL queries, it features polynomial-time pattern matching to compute an overapproximation of query results from the database. Moreover, to achieve runtimes competing with state-of-the-art database systems, we develop a novel algorithmic solution to dual simulation graph pattern matching, based on a system of inequalities that allows for several optimization heuristics. Finally, we achieve soundness of our process for SPARQL queries including UNION, AND and OPTIONAL operators not restricted to well-designed patterns. Our experiments on synthetic and real-world graph data promise a clear gain for graph database systems when incorporating the new dual simulation techniques. In this supplement paper we present in detail all proofs, discussions of experimental results, and complexity analysis for the original paper "Fast Dual Simulation Processing of Graph Database Queries" included in the Proceedings of the 35th IEEE International Conference on Data Engineering (ICDE 2019), Macau, China.
翻译:图形数据库中的语言具有显眼性, 但却计算出昂贵的模式匹配能力。 例如, 在 SPARQL 中, 回答选题查询条款使查询评估问题立即完成。 因此, 模拟等轻量图形匹配关系最近被调查为较昂贵的查询机制的有希望的替代方法, 例如, 计算子图谱是形态学。 然而, 光图匹配本身缺乏显示查询能力: 所有的图案都是由通常的组合结构结合的, 使解决方案对新兴应用程序有用, 由此必然产生更先进的能力。 在本文中, 我们通过在 SPARQL 查询中引入一个新的双向模拟程序来弥补这一差距。 最后, 除了支持 SPARQL 查询的完整合成结构, 模拟, 它的光量图形匹配模式匹配, 来计算数据库查询结果的过度匹配。 此外, 为了实现与最先进的数据库系统竞争, 我们开发了一种新的模拟图案模型匹配的新算法解决方案, 利用一种不平等系统, 来优化双向图像处理。 最后, 我们实现了对SPARQ 的精确模型分析过程的精确分析,, 我们的精确的精确的模拟模型中, 包括了我们对SPARILL 数据库中的所有的精确分析过程,, 的精确分析, 在SLILILLLLLLLL 的精确分析, 在SLLLL 的精确的精确的精确的精确的精确的精确的模型中, 数据库中,, 我们的精确的精确的精确的计算中, 。