Graph neural networks (GNNs) have been demonstrated as a powerful tool for analysing non-Euclidean graph data. However, the lack of efficient distributed graph learning (GL) systems severely hinders applications of GNNs, especially when graphs are big and GNNs are relatively deep. Herein, we present GraphTheta, a novel distributed and scalable GL system implemented in vertex-centric graph programming model. GraphTheta is the first GL system built upon distributed graph processing with neural network operators implemented as user-defined functions. This system supports multiple training strategies, and enables efficient and scalable big graph learning on distributed (virtual) machines with low memory each. To facilitate graph convolution implementations, GraphTheta puts forward a new GL abstraction named NN-TGAR to bridge the gap between graph processing and graph deep learning. A distributed graph engine is proposed to conduct the stochastic gradient descent optimization with a hybrid-parallel execution. Moreover, we add support for a new cluster-batched training strategy besides global-batch and mini-batch. We evaluate GraphTheta using a number of datasets with network size ranging from small-, modest- to large-scale. Experimental results show that GraphTheta can scale well to 1,024 workers for training an in-house developed GNN on an industry-scale Alipay dataset of 1.4 billion nodes and 4.1 billion attributed edges, with a cluster of CPU virtual machines (dockers) of small memory each (5$\sim$12GB). Moreover, GraphTheta obtains comparable or better prediction results than the state-of-the-art GNN implementations, demonstrating its capability of learning GNNs as well as existing frameworks, and can outperform DistDGL by up to $2.02\times$ with better scalability. To the best of our knowledge, this work presents the largest edge-attributed GNN learning task conducted in the literature.
翻译:图表神经网络 (GNN) 已被证明是分析非 Euclide 图形数据的强大工具。 然而, 缺少高效分布式图表学习 (GL) 系统严重妨碍了 GNS 的应用, 特别是当图形大, GNN 相对深时。 我们在此介绍在顶端中心图形编程模型中实施的GreaphTheta, 一个小的分布式和可缩放的GLS系统。 GapTheta 是第一个基于分布式图表处理的GL系统, 由神经网络操作者作为用户定义的功能实施。 这个系统支持多项培训战略, 并使得在分布式( 虚拟) 图像学习( G02 GL) 系统中高效和可缩放的大图表学习( 虚拟) 。 为了促进图变色色化执行, 我们推出一个新的 GNG- TG- TG 模型模型模型模型, 以更好的数据运行方式, 以普通的 G- NOOO 数据库运行方式, 以最普通的 数字 来展示 。