As a classical problem in the field of complex networks, link prediction has attracted much attention from researchers, which is of great significance to help us understand the evolution and dynamic development mechanisms of networks. Although various network type-specific algorithms have been proposed to tackle the link prediction problem, most of them suppose that the network structure is dominated by the Triadic Closure Principle. We still lack an adaptive and comprehensive understanding of network formation patterns for predicting potential links. In addition, it is valuable to investigate how network local information can be better utilized. To this end, we proposed a novel method named Link prediction using Multiple Order Local Information (MOLI) that exploits the local information from the neighbors of different distances, with parameters that can be a prior-driven based on prior knowledge, or data-driven by solving an optimization problem on observed networks. MOLI defined a local network diffusion process via random walks on the graph, resulting in better use of network information. We show that MOLI outperforms the other 11 widely used link prediction algorithms on 11 different types of simulated and real-world networks. We also conclude that there are different patterns of local information utilization for different networks, including social networks, communication networks, biological networks, etc. In particular, the classical common neighbor-based algorithm is not as adaptable to all social networks as it is perceived to be; instead, some of the social networks obey the Quadrilateral Closure Principle which preferentially connects paths of length three.
翻译:作为复杂网络领域的一个古老问题,链接预测吸引了研究人员的极大关注,这非常重要,有助于我们理解网络的演变和动态发展机制。虽然提出了各种网络类型特定算法,以解决链接预测问题,但大多数假设网络结构由三重封闭原则主导。我们仍然缺乏对网络形成模式的适应性和全面了解,无法预测潜在链接。此外,调查如何更好地利用网络本地信息是有价值的。为此,我们提议了一个名为链接预测的新方法,它利用了来自不同距离邻国的多秩序地方信息(多秩序本地信息),利用了不同距离的近邻的本地信息,并提出了以前可以根据先前知识驱动的参数,或通过解决所观察到的网络的优化问题驱动的数据参数。MOLI通过在图表上随机走动确定了本地网络的网络传播进程,从而更好地利用了网络信息。我们表明,MOLI在11种不同的模拟和实际世界网络上,超越了广泛使用的11种广泛使用的链接的预测算法。我们还得出结论,不同网络的当地信息利用模式不同,包括社交网络,而不是传统的社交网络,而传统的社交网络是传统的、特定的网络。