Graph Neural Networks (GNNs) for prediction tasks like node classification or edge prediction have received increasing attention in recent machine learning from graphically structured data. However, a large quantity of labeled graphs is difficult to obtain, which significantly limits the true success of GNNs. Although active learning has been widely studied for addressing label-sparse issues with other data types like text, images, etc., how to make it effective over graphs is an open question for research. In this paper, we present an investigation on active learning with GNNs for node classification tasks. Specifically, we propose a new method, which uses node feature propagation followed by K-Medoids clustering of the nodes for instance selection in active learning. With a theoretical bound analysis we justify the design choice of our approach. In our experiments on four benchmark datasets, the proposed method outperforms other representative baseline methods consistently and significantly.
翻译:用于诸如节点分类或边缘预测等预测任务的“神经网络”图(GNNs)在最近机器从图形结构化数据中学习的过程中日益受到越来越多的关注。然而,大量贴标签的图表很难获得,这大大限制了GNNs的真正成功。虽然已经广泛研究了如何与其他类型的数据(如文本、图像等)一起解决标签偏差问题的积极学习,但如何使其有效超越图表是一个有待研究的问题。在本文件中,我们介绍了关于与GNS一起积极学习用于节点分类任务的调查。具体地说,我们提出了一种新的方法,在积极学习中,采用节点的节点的节点组合为节点进行节点的节点传播,然后采用K-Medoids群集作为实例选择。通过理论约束性分析,我们有理由选择我们的方法。在四个基准数据集的实验中,拟议的方法与其他有代表性的基准方法一致和显著地比其他基准方法。