Latent position models are widely used for the analysis of networks in a variety of research fields. In fact, these models possess a number of desirable theoretical properties, and are particularly easy to interpret. However, statistical methodologies to fit these models generally incur a computational cost which grows with the square of the number of nodes in the graph. This makes the analysis of large social networks impractical. In this paper, we propose a new method characterised by a much reduced computational complexity, which can be used to fit latent position models on networks of several tens of thousands nodes. Our approach relies on an approximation of the likelihood function, where the amount of noise introduced by the approximation can be arbitrarily reduced at the expense of computational efficiency. We establish several theoretical results that show how the likelihood error propagates to the invariant distribution of the Markov chain Monte Carlo sampler. In particular, we demonstrate that one can achieve a substantial reduction in computing time and still obtain a good estimate of the latent structure. Finally, we propose applications of our method to simulated networks and to a large coauthorships network, highlighting the usefulness of our approach.
翻译:长期定位模型被广泛用于分析各种研究领域的网络。事实上,这些模型具有一些可取的理论属性,特别容易解释。然而,适应这些模型的统计方法通常会产生计算成本,随着图中节点的平方数增长。这使得大型社交网络的分析不切实际。在本文中,我们提出了一个新的方法,该方法的特点是计算复杂性大大降低,可以用来在数万个节点的网络上配置潜伏定位模型。我们的方法依赖于对可能性功能的近似,即近似所引入的噪音数量可以任意减少而牺牲计算效率。我们建立了若干理论结果,表明这些可能的错误是如何扩散到马可夫链蒙特卡洛取样器的无常分布的。特别是,我们证明,一个人可以大量减少计算时间,并且仍然能够很好地估计潜伏的结构。最后,我们建议应用我们模拟网络和大型合作者网络的方法,突出我们的方法的效用。</s>