Link prediction is central to many real-world applications, but its performance may be hampered when the graph of interest is sparse. To alleviate issues caused by sparsity, we investigate a previously overlooked phenomenon: in many cases, a densely connected, complementary graph can be found for the original graph. The denser graph may share nodes with the original graph, which offers a natural bridge for transferring meaningful knowledge. We identify this setting as Graph Intersection-induced Transfer Learning (GITL), which is motivated by practical applications in e-commerce or academic co-authorship predictions. We develop a framework to effectively leverage the structural prior in this setting. We first create an intersection subgraph using the shared nodes between the two graphs, then transfer knowledge from the source-enriched intersection subgraph to the full target graph. In the second step, we consider two approaches: a modified label propagation, and a multi-layer perceptron (MLP) model in a teacher-student regime. Experimental results on proprietary e-commerce datasets and open-source citation graphs show that the proposed workflow outperforms existing transfer learning baselines that do not explicitly utilize the intersection structure.
翻译:链接预测是许多真实世界应用的核心,但当兴趣图少见时,其性能可能会受到阻碍。为了缓解由偏僻性引起的问题,我们调查了一个以前被忽视的现象:在许多情况下,可以找到一个连接密度高、互补的原始图形。较稠密的图形可以与原始图形共享节点,这为有意义的知识转让提供了天然的桥梁。我们把这一设置确定为图表跨部门诱导的转移学习(GITL),这是在电子商务或学术共同作者预测中的实际应用所驱动的。我们开发了一个框架,以有效地利用这一设置之前的结构。我们首先利用两个图形之间的共同节点创建一个交叉子图,然后将知识从来源丰富、交叉的子图转到整个目标图中。在第二步,我们考虑两种方法:修改标签传播,在教师-学生制度中采用多层透镜模型。关于专有电子商务数据集和开放源引用图的实验结果显示,拟议的工作流程超越了未明确利用交叉结构的现有转移学习基线。</s>