Graph Neural Networks (GNNs) have received much attention in the graph deep learning domain. However, recent research empirically and theoretically shows that deep GNNs suffer from over-fitting and over-smoothing problems. The usual solutions either cannot solve extensive runtime of deep GNNs or restrict graph convolution in the same feature space. We propose the Adaptive Graph Diffusion Networks (AGDNs) which perform multi-layer generalized graph diffusion in different feature spaces with moderate complexity and runtime. Standard graph diffusion methods combine large and dense powers of the transition matrix with predefined weighting coefficients. Instead, AGDNs combine smaller multi-hop node representations with learnable and generalized weighting coefficients. We propose two scalable mechanisms of weighting coefficients to capture multi-hop information: Hop-wise Attention (HA) and Hop-wise Convolution (HC). We evaluate AGDNs on diverse, challenging Open Graph Benchmark (OGB) datasets with semi-supervised node classification and link prediction tasks. Until the date of submission (Aug 26, 2022), AGDNs achieve top-1 performance on the ogbn-arxiv, ogbn-proteins and ogbl-ddi datasets and top-3 performance on the ogbl-citation2 dataset. On the similar Tesla V100 GPU cards, AGDNs outperform Reversible GNNs (RevGNNs) with 13% complexity and 1% training runtime of RevGNNs on the ogbn-proteins dataset. AGDNs also achieve comparable performance to SEAL with 36% training and 0.2% inference runtime of SEAL on the ogbl-citation2 dataset.
翻译:内建图网络(GNN)在深深的图形学习领域受到了很多关注。然而,最近的实验和理论研究表明,深GNNN受到超装和超制问题的影响。通常的解决办法要么无法解决深GNNN的广泛运行时间,要么无法在同一功能空间限制图图变动。我们建议采用适应性图形扩散网络(AGDN),在不同特点空间进行多层通用图传播,具有中等复杂性和运行时间。标准图传播方法将过渡矩阵的大型和密集能力与预先界定的加权系数结合起来。相反,AGDNNN将较小的多错位节点表示与可学习和通用的加权系数相结合。我们建议采用两种可升级的加权系数机制,以获取多点信息:超点关注(HAHA)和高点传动(HC)。我们用半超超标的节点分类和链接预测任务,直到提交日期(Aug 2022),AGDNBSO培训与SOB的顶级数据运行时间(GNO-B),也实现最高性GODOD(OD%GOD)的运行和SODODOD的运行数据。