The problem of predicting links in large networks is an important task in a variety of practical applications, including social sciences, biology and computer security. In this paper, statistical techniques for link prediction based on the popular random dot product graph model are carefully presented, analysed and extended to dynamic settings. Motivated by a practical application in cyber-security, this paper demonstrates that random dot product graphs not only represent a powerful tool for inferring differences between multiple networks, but are also efficient for prediction purposes and for understanding the temporal evolution of the network. The probabilities of links are obtained by fusing information at two stages: spectral methods provide estimates of latent positions for each node, and time series models are used to capture temporal dynamics. In this way, traditional link prediction methods, usually based on decompositions of the entire network adjacency matrix, are extended using temporal information. The methods presented in this article are applied to a number of simulated and real-world graphs, showing promising results.
翻译:在包括社会科学、生物学和计算机安全在内的各种实际应用中,预测大型网络联系的问题是一项重要任务。在本文中,根据流行随机点产品图表模型进行链接预测的统计技术经过仔细介绍、分析并推广到动态环境。由于网络安全的实用应用,本文表明随机点产品图不仅代表了推断多个网络之间差异的有力工具,而且对于预测目的和了解网络的时间演变也十分有效。链接的概率是通过在两个阶段使用信息获得的:光谱方法为每个节点提供潜在位置的估计数,使用时间序列模型来捕捉时间动态。这样,通常基于整个网络相邻矩阵分解的传统链接预测方法,利用时间信息加以扩展。本篇文章中介绍的方法适用于若干模拟和实际世界的图表,显示有希望的结果。