Graph data management (also called NoSQL) has revealed beneficial characteristics in terms of flexibility and scalability by differently balancing between query expressivity and schema flexibility. This peculiar advantage has resulted into an unforeseen race of developing new task-specific graph systems, query languages and data models, such as property graphs, key-value, wide column, resource description framework (RDF), etc. Present-day graph query languages are focused towards flexible graph pattern matching (aka sub-graph matching), whereas graph computing frameworks aim towards providing fast parallel (distributed) execution of instructions. The consequence of this rapid growth in the variety of graph-based data management systems has resulted in a lack of standardization. Gremlin, a graph traversal language, and machine provides a common platform for supporting any graph computing system (such as an OLTP graph database or OLAP graph processors). We present a formalization of graph pattern matching for Gremlin queries. We also study, discuss and consolidate various existing graph algebra operators into an integrated graph algebra.
翻译:图表数据管理(又称 NoSQL)通过在查询表达性和系统灵活度之间的不同平衡,揭示了灵活度和可缩放性方面的有利特点,这一特殊优势导致开发新的特定任务的图表系统、查询语言和数据模型,如属性图、关键值、宽柱、资源说明框架(RDF)等,出现意外的竞赛。 当今的图表查询语言侧重于灵活的图表模式匹配(aka子图匹配),而图表计算框架旨在提供快速平行(分布)执行指令。基于图表的数据管理系统的迅速增长导致缺乏标准化。Gremlin, 图形穿行语言和机器为支持任何图表计算系统(如OLTP图形数据库或OLAP图形处理器)提供了一个共同平台。我们为Gremlin查询提供了图表模式匹配的正式化。我们还研究、讨论和整合了各种现有的图形等值操作器,并将其合并成一个综合的图表代数。