Graph Neural Networks (GNNs) have shown advantages in various graph-based applications. Most existing GNNs assume strong homophily of graph structure and apply permutation-invariant local aggregation of neighbors to learn a representation for each node. However, they fail to generalize to heterophilic graphs, where most neighboring nodes have different labels or features, and the relevant nodes are distant. Few recent studies attempt to address this problem by combining multiple hops of hidden representations of central nodes (i.e., multi-hop-based approaches) or sorting the neighboring nodes based on attention scores (i.e., ranking-based approaches). As a result, these approaches have some apparent limitations. On the one hand, multi-hop-based approaches do not explicitly distinguish relevant nodes from a large number of multi-hop neighborhoods, leading to a severe over-smoothing problem. On the other hand, ranking-based models do not joint-optimize node ranking with end tasks and result in sub-optimal solutions. In this work, we present Graph Pointer Neural Networks (GPNN) to tackle the challenges mentioned above. We leverage a pointer network to select the most relevant nodes from a large amount of multi-hop neighborhoods, which constructs an ordered sequence according to the relationship with the central node. 1D convolution is then applied to extract high-level features from the node sequence. The pointer-network-based ranker in GPNN is joint-optimized with other parts in an end-to-end manner. Extensive experiments are conducted on six public node classification datasets with heterophilic graphs. The results show that GPNN significantly improves the classification performance of state-of-the-art methods. In addition, analyses also reveal the privilege of the proposed GPNN in filtering out irrelevant neighbors and reducing over-smoothing.
翻译:内建网络( GNN) 在许多基于图形的应用程序中显示出优势 。 多数现有的 GNN 假设图形结构的相似性强, 并应用本地邻居的极异集合来学习每个节点的演示。 但是, 它们没有将相关节点与大量多点区块的超正统图形进行概括化, 导致严重的超偏差问题 。 另一方面, 基于排序的模型不会在中央节点( 即, 多点- 多点- 多点- 多点- 多点- 多点- 网络) 的多点隐形表达, 或者根据关注分( e. 以排名为基础的方法) 排序相邻节点。 因此, 这些方法有一些明显的局限性。 一方面, 多点- 基点- 方法并没有将相关节点与大量多点的多点( 多点) 相交错点 。 基于排序的模型不会与最终任务进行联合优化的节点排序, 并导致次优化的解决方案 。 在这项工作中, 我们使用最深点- 内建的轨道 网络 将显示一个高点 。