Graph Convolutional Networks (GCNs) have been widely applied in various fields due to their significant power on processing graph-structured data. Typical GCN and its variants work under a homophily assumption (i.e., nodes with same class are prone to connect to each other), while ignoring the heterophily which exists in many real-world networks (i.e., nodes with different classes tend to form edges). Existing methods deal with heterophily by mainly aggregating higher-order neighborhoods or combing the immediate representations, which leads to noise and irrelevant information in the result. But these methods did not change the propagation mechanism which works under homophily assumption (that is a fundamental part of GCNs). This makes it difficult to distinguish the representation of nodes from different classes. To address this problem, in this paper we design a novel propagation mechanism, which can automatically change the propagation and aggregation process according to homophily or heterophily between node pairs. To adaptively learn the propagation process, we introduce two measurements of homophily degree between node pairs, which is learned based on topological and attribute information, respectively. Then we incorporate the learnable homophily degree into the graph convolution framework, which is trained in an end-to-end schema, enabling it to go beyond the assumption of homophily. More importantly, we theoretically prove that our model can constrain the similarity of representations between nodes according to their homophily degree. Experiments on seven real-world datasets demonstrate that this new approach outperforms the state-of-the-art methods under heterophily or low homophily, and gains competitive performance under homophily.
翻译:GCN及其变体在同一个假设下工作(即,同一类的节点容易相互连接),而忽视许多真实世界网络中存在的偏差性(即,不同类的节点往往形成边缘)。现有方法与不同领域格格不入,主要汇集较高级的邻里或梳理直接表达方式,结果导致噪音和不相干的信息。但这些方法并没有改变在同级假设下运作的传播机制(即,同一类的节点容易相互连接),而忽视许多真实世界网络中存在的偏差性(即,不同类的节点往往形成边缘 ) 。 现有方法与不同领域格格格不入,主要通过聚集较高级的邻里区或梳理直接代表方式,从而导致结果产生噪音和不相干的信息。但是这些方法并没有改变在同级假设下运作的传播机制(这是GCNs的一个基本部分 ) 。为了解决这个问题,我们设计了一个新的传播机制,它可以自动改变传播和汇总过程,在交替模式中,我们从正调的相互比较的角度,我们用两种程度来测量不同等级的等级的等级的等级,我们之间的等级,在上学习了更接近的正态的正态的正变的内,然后将数据转化为的顺序中学习了。