Graph transformer networks (GTN) are a variant of graph convolutional networks (GCN) that are targeted to heterogeneous graphs in which nodes and edges have associated type information that can be exploited to improve inference accuracy. GTNs learn important metapaths in the graph, create weighted edges for these metapaths, and use the resulting graph in a GCN. Currently, the only available implementation of GTNs uses dense matrix multiplication to find metapaths. Unfortunately, the space overhead of this approach can be large, so in practice it is used only for small graphs. In addition, the matrix-based implementation is not fine-grained enough to use random-walk based methods to optimize metapath finding. In this paper, we present a graph-based formulation and implementation of the GTN metapath finding problem. This graph-based formulation has two advantages over the matrix-based approach. First, it is more space efficient than the original GTN implementation and more compute-efficient for metapath sizes of practical interest. Second, it permits us to implement a sampling method that reduces the number of metapaths that must be enumerated, allowing the implementation to be used for larger graphs and larger metapath sizes. Experimental results show that our implementation is $6.5\times$ faster than the original GTN implementation on average for a metapath length of 4, and our sampling implementation is $155\times$ faster on average than this implementation without compromising on the accuracy of the GTN.
翻译:图表变形器网络(GTN)是图形变形网络的变体,其对象为结点和边缘具有相关类型信息,可以用来提高推断准确性。GTN在图形中学习重要的元路径,为这些元路径创建加权边緣,并在GCN中使用由此形成的图。目前,只有GTN的可用实施方使用密集矩阵乘法寻找元路。不幸的是,这一方法的空间间接成本可能很大,因此实际上它只用于小图表。此外,基于矩阵的执行不够精细,无法使用随机行走方法优化元路发现。在本文件中,我们用图表来提出GTN的元路径发现问题。这种基于图形的配方比基于矩阵的方法有两种优势。首先,它比最初的GTN执行效率更高,而且对于符合实际兴趣的元流体大小来说,它允许我们用一个抽样方法来减少基路径的准确度数量,而不是以随机行进法为基础的方法,在原位图上,使GMDM的执行率比原值要大,在GMDMD上显示,在6的执行率上,使用比原的执行速度要快。