Graph-based learning is a rapidly growing sub-field of machine learning with applications in social networks, citation networks, and bioinformatics. One of the most popular models is graph attention networks. They were introduced to allow a node to aggregate information from features of neighbor nodes in a non-uniform way, in contrast to simple graph convolution which does not distinguish the neighbors of a node. In this paper, we study theoretically this expected behaviour of graph attention networks. We prove multiple results on the performance of graph attention mechanism for the problem of node classification for a contextual stochastic block model. Here the node features are obtained from a mixture of Gaussians and the edges from a stochastic block model. We show that in an "easy" regime, where the distance between the means of the Gaussians is large enough, graph attention is able to distinguish inter-class from intra-class edges, and thus it maintains the weights of important edges and significantly reduces the weights of unimportant edges. Consequently, we show that this implies perfect node classification. In the "hard" regime, we show that every attention mechanism fails to distinguish intra-class from inter-class edges. We evaluate our theoretical results on synthetic and real-world data.
翻译:基于图形的学习是一个快速增长的机器学习的子领域,在社交网络、引言网络和生物信息学的应用中应用。最受欢迎的模型之一是图形关注网络。它们被引入是为了允许节点以非统一的方式汇总邻居节点特征的信息,而不是简单的图形变异,无法区分节点的邻居。在本文中,我们从理论上研究了图形关注网络的预期行为。我们证明图形关注机制的性能取得了多种结果,从而解决了背景切分区块模型的节点分类问题。这里的节点特征来自高斯人的混合体,而节点特征则来自随机区块模型的边缘。我们显示,在“容易”的系统中,高斯人手段之间的距离足够大,因此,图表关注能够区分不同阶级之间的边缘,从而保持重要边缘的权重,并大大降低非重要边缘的重量。因此,我们表明,这暗示着完美的节点分类。在“硬”制度下,我们展示了“稳重”的节点特征特征,从一个“容易”的区块模式的边缘,我们展示了“容易”制度,在高斯人之间的距离上,我们无法区分我们的数据机制。