Relational events are a type of social interactions, that sometimes are referred to as dynamic networks. Its dynamics typically depends on emerging patterns, so-called endogenous variables, or external forces, referred to as exogenous variables. Comprehensive information on the actors in the network, especially for huge networks, is rare, however. A latent space approach in network analysis has been a popular way to account for unmeasured covariates that are driving network configurations. Bayesian and EM-type algorithms have been proposed for inferring the latent space, but both the sheer size many social network applications as well as the dynamic nature of the process, and therefore the latent space, make computations prohibitively expensive. In this work we propose a likelihood-based algorithm that can deal with huge relational event networks. We propose a hierarchical strategy for inferring network community dynamics embedded into an interpretable latent space. Node dynamics are described by smooth spline processes. To make the framework feasible for large networks we borrow from machine learning optimization methodology. Model-based clustering is carried out via a convex clustering penalization, encouraging shared trajectories for ease of interpretation. We propose a model-based approach for separating macro-microstructures and perform a hierarchical analysis within successive hierarchies. The method can fit millions of nodes on a public Colab GPU in a few minutes. The code and a tutorial are available in a Github repository.
翻译:关系事件是一种社交互动,有时被称为动态网络。其动态通常取决于出现的模式,称为内生变量,或外部力量,称为外生变量。然而,对于大型网络来说,有关网络中参与者的全面信息非常罕见。网络分析中的潜在空间方法已经成为考虑驱动网络配置的未测量协变量的流行方法。已经提出了贝叶斯和EM类型的算法来推断潜在空间,但是由于过多的社交网络应用程序的大小以及过程和因此潜在空间的动态性,使得计算变得极其昂贵。在这项工作中,我们提出了一种可以处理巨大关系事件网络的基于可能性的算法。我们提出了一种层次结构策略,用于推断嵌入在可解释的潜在空间中的网络社区动态。节点动态由平滑样条过程描述。为使框架适用于大型网络,我们借鉴了机器学习优化方法学中所使用的方法。模型的聚类是通过凸聚类惩罚进行的,鼓励共享轨迹以便于解释。我们提出了一种分离宏观-微观结构的基于模型的方法,并在连续层次结构内进行层次分析。该方法可以在公共的Colab GPU上在几分钟内匹配数百万个节点。代码和教程在Github存储库中提供。