We describe a method to select the nodes in Graph datasets for training so that the model trained on the points selected will be be better than the ones if we select other points for the purpose of training. This is a very important aspect as the process of labelling the points is often a costly affair. The usual Active Learning methods are good but the penalty involved with these methods is that, we need to re-train the model after selecting the nodes in each iteration of Active Learning cycle. We come up with a method which use the concept of Graph Centrality to select the nodes for labeling and training initially and the training is needed to perform only once. We have tested this idea on three graph datasets - Cora, Citeseer and Pubmed- and the results are really encouraging.
翻译:我们描述在图表数据集中选择用于培训的节点的方法,这样,如果我们为培训目的选择其他点,在选定的点上培训的模型比那些点上培训的模型要好。这是一个非常重要的方面,因为标记点的过程往往费用高昂。通常的主动学习方法是好的,但与这些方法有关的惩罚是,我们需要在选择主动学习周期的每个迭代节点后再对模型进行再培训。我们想出一种方法,用图形中心概念来选择最初标签和培训的节点,而培训只需要一次。我们已经在三个图表数据集(Cora、Citeser和Pubmed)上测试了这个想法,结果非常令人鼓舞。