Learning unbiased node representations for imbalanced samples in the graph has become a more remarkable and important topic. For the graph, a significant challenge is that the topological properties of the nodes (e.g., locations, roles) are unbalanced (topology-imbalance), other than the number of training labeled nodes (quantity-imbalance). Existing studies on topology-imbalance focus on the location or the local neighborhood structure of nodes, ignoring the global underlying hierarchical properties of the graph, i.e., hierarchy. In the real-world scenario, the hierarchical structure of graph data reveals important topological properties of graphs and is relevant to a wide range of applications. We find that training labeled nodes with different hierarchical properties have a significant impact on the node classification tasks and confirm it in our experiments. It is well known that hyperbolic geometry has a unique advantage in representing the hierarchical structure of graphs. Therefore, we attempt to explore the hierarchy-imbalance issue for node classification of graph neural networks with a novelty perspective of hyperbolic geometry, including its characteristics and causes. Then, we propose a novel hyperbolic geometric hierarchy-imbalance learning framework, named HyperIMBA, to alleviate the hierarchy-imbalance issue caused by uneven hierarchy-levels and cross-hierarchy connectivity patterns of labeled nodes.Extensive experimental results demonstrate the superior effectiveness of HyperIMBA for hierarchy-imbalance node classification tasks.
翻译:学习不偏见的节点表示以应对图中不平衡样本已经成为一个越来越重要的话题。在网络图中,一个重要的挑战是,节点的拓扑特性(例如位置、角色)是不平衡的(拓扑不平衡),而不仅仅是具有标签的训练节点数量上的不平衡。针对拓扑失衡问题的现有研究侧重于节点的位置或局部邻域结构,忽略了图的全局底层分层结构,即层次结构。在现实场景中,图数据的层次结构揭示了图的重要拓扑特性,并与多种应用相关。我们发现,具有不同层次属性的标记训练节点对节点分类任务有重大影响,并在实验中进行了确认。众所周知,双曲几何在表示图的层次结构方面具有独特优势。因此,我们尝试从超几何学的新颖角度,包括其特点和成因,探索网络图神经网络层次不平衡问题。然后,我们提出了一种新颖的超几何层次不平衡学习框架,称为HyperIMBA,以缓解由标记节点不平衡层次级别和跨层次连接模式引起的层次不平衡问题。广泛的实验结果验证了HyperIMBA对于层次不平衡节点分类任务的优越有效性。