Federated learning has attracted much research attention due to its privacy protection in distributed machine learning. However, existing work of federated learning mainly focuses on Convolutional Neural Network (CNN), which cannot efficiently handle graph data that are popular in many applications. Graph Convolutional Network (GCN) has been proposed as one of the most promising techniques for graph learning, but its federated setting has been seldom explored. In this paper, we propose FedGraph for federated graph learning among multiple computing clients, each of which holds a subgraph. FedGraph provides strong graph learning capability across clients by addressing two unique challenges. First, traditional GCN training needs feature data sharing among clients, leading to risk of privacy leakage. FedGraph solves this issue using a novel cross-client convolution operation. The second challenge is high GCN training overhead incurred by large graph size. We propose an intelligent graph sampling algorithm based on deep reinforcement learning, which can automatically converge to the optimal sampling policies that balance training speed and accuracy. We implement FedGraph based on PyTorch and deploy it on a testbed for performance evaluation. The experimental results of four popular datasets demonstrate that FedGraph significantly outperforms existing work by enabling faster convergence to higher accuracy.
翻译:联邦学习因其在分布式机器学习中的隐私保护而吸引了大量的研究关注,然而,联邦学习的现有工作主要侧重于进化神经网络(CNN),后者无法有效地处理许多应用中流行的图表数据。图表革命网络(GCN)被提议为最有希望的图表学习技术之一,但其联合会背景很少探索。我们在此文件中提议Fed Grph,供多个计算机客户使用联邦化图表学习,每个客户都有一份分图。Fed Graph通过应对两个独特的挑战,为客户提供很强的图表学习能力。首先,传统的GCN培训需要将客户共享数据,从而导致隐私流失的风险。Fed Graph利用新的跨客户组合操作解决这一问题。第二个挑战是高水平的GCN培训间接费用,因为其大图形规模。我们提议基于深度强化学习的智能图形抽样算法,它可以自动与平衡培训速度和准确性的最佳抽样政策相融合。我们根据PyTorrch实施FedGraph,并将其置于业绩评估的更高测试台上。四个大众数据组合的实验结果显示,使FDGraph能够快速地使现有数据精确化。