Link prediction -- the process of uncovering missing links in a complex network -- is an important problem in information sciences, with applications ranging from social sciences to molecular biology. Recent advances in neural graph embeddings have proposed an end-to-end way of learning latent vector representations of nodes, with successful application in link prediction tasks. Yet, our understanding of the internal mechanisms of such approaches has been rather limited, and only very recently we have witnessed the development of a very compelling connection to the mature matrix factorization theory. In this work, we make an important contribution to our understanding of the interplay between the skip-gram powered neural graph embedding algorithms and the matrix factorization via SVD. In particular, we show that the link prediction accuracy of graph embeddings strongly depends on the transformations of the original graph co-occurrence matrix that they decompose, sometimes resulting in staggering boosts of accuracy performance on link prediction tasks. Our improved approach to learning low-rank factorization embeddings that incorporate information from unlikely pairs of nodes yields results on par with the state-of-the-art link prediction performance achieved by a complex neural graph embedding model
翻译:链接预测 -- -- 发现复杂网络中缺失的环节的过程 -- -- 是信息科学中的一个重要问题,从社会科学到分子生物学等各种应用。神经图嵌入的最近进展提出了一种端到端学习节点潜在矢量表达方式的方法,成功地应用了链接预测任务。然而,我们对这类方法的内部机制的理解相当有限,直到最近,我们才目睹了与成熟矩阵因子化理论建立非常令人信服的联系。在这项工作中,我们为理解跳格动力神经图嵌入算法和通过SVD矩阵因子化之间的相互作用作出了重要贡献。特别是,我们表明,图形嵌入过程的连接准确性在很大程度上取决于原始图形共振动矩阵的变异,有时导致连接预测任务的精确性能的惊人提高。我们改进了学习低级因子嵌入的方法,将不可能得到的节点对子嵌入的信息纳入与复杂神经图嵌入模型所实现的状态-艺术链接性预测性能产生结果。