Entity alignment is a crucial task in knowledge graph fusion. However, most entity alignment approaches have the scalability problem. Recent methods address this issue by dividing large KGs into small blocks for embedding and alignment learning in each. However, such a partitioning and learning process results in an excessive loss of structure and alignment. Therefore, in this work, we propose a scalable GNN-based entity alignment approach to reduce the structure and alignment loss from three perspectives. First, we propose a centrality-based subgraph generation algorithm to recall some landmark entities serving as the bridges between different subgraphs. Second, we introduce self-supervised entity reconstruction to recover entity representations from incomplete neighborhood subgraphs, and design cross-subgraph negative sampling to incorporate entities from other subgraphs in alignment learning. Third, during the inference process, we merge the embeddings of subgraphs to make a single space for alignment search. Experimental results on the benchmark OpenEA dataset and the proposed large DBpedia1M dataset verify the effectiveness of our approach.
翻译:然而,大多数实体调整办法都存在可缩放问题。最近采用的方法是将大型KG分成小块,在每一个单元中进行嵌入和校准学习。然而,这种分割和学习过程导致结构和校准的过度损失。因此,在这项工作中,我们建议采用可缩放的GNN实体调整办法,从三个角度减少结构和校准损失。首先,我们提议采用基于核心的子图生成算法,以回顾作为不同子集之间桥梁的一些里程碑实体。第二,我们采用自我监督的实体重建,从不完整的邻里子集中恢复实体的表示,并设计跨次抽样负面抽样,以便将其他子集体的实体纳入校准学习。第三,在推断过程中,我们将子图的嵌入结合起来,为校准搜索提供一个单一的空间。关于OpenEA数据集的基准和拟议的大型DBpedia1M数据集的实验结果,可以验证我们的方法的有效性。