Link prediction (LP) has been recognized as an important task in graph learning with its broad practical applications. A typical application of LP is to retrieve the top scoring neighbors for a given source node, such as the friend recommendation. These services desire the high inference scalability to find the top scoring neighbors from many candidate nodes at low latencies. There are two popular decoders that the recent LP models mainly use to compute the edge scores from node embeddings: the HadamardMLP and Dot Product decoders. After theoretical and empirical analysis, we find that the HadamardMLP decoders are generally more effective for LP. However, HadamardMLP lacks the scalability for retrieving top scoring neighbors on large graphs, since to the best of our knowledge, there does not exist an algorithm to retrieve the top scoring neighbors for HadamardMLP decoders in sublinear complexity. To make HadamardMLP scalable, we propose the Flashlight algorithm to accelerate the top scoring neighbor retrievals for HadamardMLP: a sublinear algorithm that progressively applies approximate maximum inner product search (MIPS) techniques with adaptively adjusted query embeddings. Empirical results show that Flashlight improves the inference speed of LP by more than 100 times on the large OGBL-CITATION2 dataset without sacrificing effectiveness. Our work paves the way for large-scale LP applications with the effective HadamardMLP decoders by greatly accelerating their inference.
翻译:链接预测( LP) 已被公认为是图表学习中的一项重要任务, 其应用范围很广 。 LP 的典型应用是检索某个源节点( 如朋友建议) 的最高评分邻居, 如朋友建议 。 这些服务希望从许多候选人节点中找到来自低延迟时间的顶评分邻居。 最近的 LP 模型有两个流行的解算器, 主要是用来计算从节点嵌入的边缘评分 : HadamardMLP 和 Dot 产品解码器 。 经过理论和经验分析, 我们发现 HadamardMLP 解码器一般对 LP 比较有效。 然而, HadamardMLMLP 缺乏在大图表中重新获取顶评分邻居的可缩放性。 因为据我们所知, 在亚行深度嵌入中, HadmardMLP decoder, 我们建议闪光算法加速为 HadmardMLP 的顶端评分检索器 。 HadamarP pline dal dal dal dal dal dal dal dal 方法, 以不断 快速地将我们的缩算法升级化的大幅升级化为升级 。