In many applications of network analysis, it is important to distinguish between observed and unobserved factors affecting network structure. To this end, we develop spectral estimators for both unobserved blocks and the effect of covariates in stochastic blockmodels. On the theoretical side, we establish asymptotic normality of our estimators for the subsequent purpose of performing inference. On the applied side, we show that computing our estimator is much faster than standard variational expectation--maximization algorithms and scales well for large networks. Monte Carlo experiments suggest that the estimator performs well under different data generating processes. Our application to Facebook data shows evidence of homophily in gender, role and campus-residence, while allowing us to discover unobserved communities. The results in this paper provide a foundation for spectral estimation of the effect of observed covariates as well as unobserved latent community structure on the probability of link formation in networks.
翻译:在网络分析的许多应用中,必须区分影响网络结构的观测和未观测因素。为此,我们为未观测区块和共异区块模型的影响开发光谱估计器。在理论方面,我们为随后进行推断的目的,确定了我们的估计器的无症状常态性。在应用方面,我们显示计算我们的估计器比大型网络标准变异预期-最大化算法和尺度要快得多。蒙特卡洛实验显示,估计器在不同数据生成程序下运行良好。我们在Facebook数据中的应用程序显示性别、角色和校园居住区等同性的证据,同时让我们能够发现未观测的社区。本文的结果为观测到的共变数以及未观测到的潜在社区结构对网络连接概率的光谱估计提供了基础。