Directional relational event data, such as email data, often contain unicast messages (i.e., messages of one sender towards one receiver) and multicast messages (i.e., messages of one sender towards multiple receivers). The Enron email data that is the focus in this paper consists of 31% multicast messages. Multicast messages contain important information about the roles of actors in the network, which is needed for better understanding social interaction dynamics. In this paper a multiplicative latent factor model is proposed to analyze such relational data. For a given message, all potential receiver actors are placed on a suitability scale, and the actors are included in the receiver set whose suitability score exceeds a threshold value. Unobserved heterogeneity in the social interaction behavior is captured using a multiplicative latent factor structure with latent variables for actors (which differ for actors as senders and receivers) and latent variables for individual messages. A Bayesian computational algorithm, which relies on Gibbs sampling, is proposed for model fitting. Model assessment is done using posterior predictive checks. Based on our analyses of the Enron email data, a mc-amen model with a 2 dimensional latent variable can accurately capture the empirical distribution of the cardinality of the receiver set and the composition of the receiver sets for commonly observed messages. Moreover the results show that actors have a comparable (but not identical) role as a sender and as a receiver in the network.
翻译:方向关系事件数据,例如电子邮件数据,往往包含单线信息(即一个发件人给一个接收者的信息)和多传信息(即一个发件人给一个接收者的信息)和多传信息(即一个发件人给多个接收者的信息)。Enron电子邮件数据是本文的重点,它包含31 % 多播信息。多播信息包含关于网络中行为者作用的重要信息,这是更好地了解社会互动动态所需要的。在本文中,建议一个多复制的潜在潜在要素模型来分析这种关系数据。对于给定信息,所有潜在接收者都按适当比例排列,而行为者则包括在接收人集中,其适应性得分超过临界值。在收集社会互动行为中未观察到到的异性异性潜在因素结构时,使用一个多复制性潜在因素结构来捕捉Enron电子邮件数据(作为发送者和接收人),以及个人电文的潜在变量变量。一个Bayesian计算算法,依靠GIbbs 取样,可以进行模型的预测检查。基于我们对Enron 电子邮件数据的分析,一个可观察到的可变式网络结构模型,一个模型显示一个可变性模型的模型的模型,一个可变式的模型的模型,用来显示普通服务器的模型的模型的模型,用来显示一个可变式的模型的模型的模型的模型的模型,用来显示。