Consistency training is a popular method to improve deep learning models in computer vision and natural language processing. Graph neural networks (GNNs) have achieved remarkable performance in a variety of network science learning tasks, but to date no work has studied the effect of consistency training on large-scale graph problems. GNNs scale to large graphs by minibatch training and subsample node neighbors to deal with high degree nodes. We utilize the randomness inherent in the subsampling of neighbors and introduce a novel consistency training method to improve accuracy. For a target node we generate different neighborhood expansions, and distill the knowledge of the average of the predictions to the GNN. Our method approximates the expected prediction of the possible neighborhood samples and practically only requires a few samples. We demonstrate that our training method outperforms standard GNN training in several different settings, and yields the largest gains when label rates are low.
翻译:统一培训是改进计算机视觉和自然语言处理方面深层次学习模式的流行方法。图形神经网络(GNN)在网络科学学习任务中取得了显著成绩,但迄今为止没有研究过对大比例图形问题进行一致性培训的影响。GNNS比大图表规模,通过微型批量培训和子模样节点培训,处理高水平节点。我们利用周边小抽样调查中固有的随机性,并采用新的一致性培训方法来提高准确性。对于目标节点,我们产生不同的社区扩展,并提取对GNN的预测平均值的知识。我们的方法接近了对可能的邻居样本的预期预测,实际上只需要少量样本。我们证明我们的培训方法在若干不同环境下比GNN标准培训高,并且在标签率低时产生最大的收益。