Graph processing has become an important part of various areas of computing, including machine learning, medical applications, social network analysis, computational sciences, and others. A growing amount of the associated graph processing workloads are dynamic, with millions of edges added or removed per second. Graph streaming frameworks are specifically crafted to enable the processing of such highly dynamic workloads. Recent years have seen the development of many such frameworks. However, they differ in their general architectures (with key details such as the support for the concurrent execution of graph updates and queries, or the incorporated graph data organization), the types of updates and workloads allowed, and many others. To facilitate the understanding of this growing field, we provide the first analysis and taxonomy of dynamic and streaming graph processing. We focus on identifying the fundamental system designs and on understanding their support for concurrency, and for different graph updates as well as analytics workloads. We also crystallize the meaning of different concepts associated with streaming graph processing, such as dynamic, temporal, online, and time-evolving graphs, edge-centric processing, models for the maintenance of updates, and graph databases. Moreover, we provide a bridge with the very rich landscape of graph streaming theory by giving a broad overview of recent theoretical related advances, and by discussing which graph streaming models and settings could be helpful in developing more powerful streaming frameworks and designs. We also outline graph streaming workloads and research challenges.
翻译:图表处理已成为各种计算领域的一个重要部分,包括机器学习、医疗应用、社会网络分析、计算科学等。越来越多的相关图表处理工作量是动态的,每秒增加或删除数以百万计的边缘。图表流框架是专门设计的,以便能够处理如此高度动态的工作量。近年来,许多此类框架的发展也各不相同。但是,它们的总体结构也各不相同(关键细节,例如支持同时进行图表更新和查询,或合并的图表数据组织)、允许的更新和工作量类型以及其他许多方面。为了便于了解这个不断增长的领域,我们首次对动态和流动图处理进行分析和分类。我们侧重于确定基本系统设计,并了解其对相通性、不同图表更新以及分析工作量的支持。我们还明确了与流式图表处理相关的不同概念的含义,如动态、时间、在线和时间变化式图表处理,允许的更新和工作量类型,以及许多其他方面。为了便于理解这个领域,我们提供了动态和流图处理的首次分析和分类。我们侧重于确定基本系统设计,我们通过构建一个与图表相关的富有的图表结构结构,我们通过一个更强有力的图表化的理论流和图表化的流程结构结构结构来讨论一个更紧密的模型,我们通过开发一个与富有的图表化的图表的图表结构结构结构结构进行。