Heterogeneous graphs, which contain nodes and edges of multiple types, are prevalent in various domains, including bibliographic networks, social media, and knowledge graphs. As a fundamental task in analyzing heterogeneous graphs, relevance measure aims to calculate the relevance between two objects of different types, which has been used in many applications such as web search, recommendation, and community detection. Most of existing relevance measures focus on homogeneous networks where objects are of the same type, and a few measures are developed for heterogeneous graphs, but they often need the pre-defined meta-path. Defining meaningful meta-paths requires much domain knowledge, which largely limits their applications, especially on schema-rich heterogeneous graphs like knowledge graphs. Recently, the Graph Neural Network (GNN) has been widely applied in many graph mining tasks, but it has not been applied for measuring relevance yet. To address the aforementioned problems, we propose a novel GNN-based relevance measure, namely GSim. Specifically, we first theoretically analyze and show that GNN is effective for measuring the relevance of nodes in the graph. We then propose a context path-based graph neural network (CP-GNN) to automatically leverage the semantics in heterogeneous graphs. Moreover, we exploit CP-GNN to support relevance measures between two objects of any type. Extensive experiments demonstrate that GSim outperforms existing measures.
翻译:含有多种类型节点和边缘的异质图形在包括书目网络、社交媒体和知识图等不同领域十分普遍。作为分析多元图的基本任务,相关性计量旨在计算两种不同对象之间的关联性,这两类不同对象在网络搜索、建议和社区检测等许多应用中都曾使用过。大部分现有相关计量措施侧重于同一类型物体的同质网络,为异质图制定了一些计量标准,但它们往往需要预先界定的元路径。定义有意义的元路径需要许多域知识,这在很大程度上限制了它们的应用,特别是模型丰富多彩的多元图,如知识图。最近,图形神经网络(GNNN)被广泛应用于许多图形采矿任务,但尚未用于衡量相关性。为了解决上述问题,我们提出了一个新的基于GNNN的关联性计量标准,即GSim。具体地说,我们首先从理论上分析并表明GNNN能够有效地测量图中点的关联性。我们然后提议一个基于环境路径的星座图目标网络,特别是精密的图质图象模型,即我们GNNNNM的模型模型,用以展示S-gmeximal 之间的任何Syalalalal 。