This paper proposes a logistic undirected network formation model which allows for assortative matching on observed individual characteristics and the presence of edge-wise fixed effects. We model the coefficients of observed characteristics to have a latent community structure and the edge-wise fixed effects to be of low rank. We propose a multi-step estimation procedure involving nuclear norm regularization, sample splitting, iterative logistic regression and spectral clustering to detect the latent communities. We show that the latent communities can be exactly recovered when the expected degree of the network is of order log n or higher, where n is the number of nodes in the network. The finite sample performance of the new estimation and inference methods is illustrated through both simulated and real datasets.
翻译:本文提出一个后勤非定向网络形成模型,允许对观察到的个别特征和边缘固定效应的存在进行各种匹配。我们将观察到的特征系数建模,以具有潜在的群落结构,边缘固定效应建模为低级。我们提出一个多步骤估计程序,涉及核规范规范规范化、样本分离、迭接后勤回归和光谱集,以探测潜伏群落。我们表明,当网络的预期水平为顺序日志n或更高时,潜伏群落可以完全恢复,n是网络中的节点数目,新估计和推断方法的有限样本性能通过模拟和真实数据集加以说明。