Mixed membership problem for undirected network has been well studied in network analysis recent years. However, the more general case of mixed membership for directed network remains a challenge. Here, we propose an interpretable and identifiable model: directed mixed membership stochastic blockmodel (DiMMSB for short) for directed mixed membership networks. DiMMSB allows that row nodes and column nodes of the adjacency matrix can be different and these nodes may have distinct community structure in a directed network. We also develop an efficient spectral algorithm called DiSP designed based on simplex structures inherent in the left and right singular vectors of the population adjacency matrix to estimate the mixed memberships for both row nodes and column nodes in a directed network. We show that DiSP is asymptotically consistent under mild conditions by providing error bounds for the inferred membership vectors of each row node and each column node using delicate spectral analysis. We demonstrate the advantages of DiSP with applications to simulated directed mixed membership network, the directed Political blogs network and the Papers Citation network.
翻译:近些年来,网络分析对非定向网络的混合成员问题进行了深入研究,然而,对定向网络的混合成员构成这一更为普遍的情况仍是一个挑战。在这里,我们提出了一个可解释和可识别的模式:针对定向混合成员网络的混合成员构成结构(DIMMSB短短),DIMMSB允许相邻矩阵的行节点和列节点不同,这些节点在定向网络中可能有不同的社区结构。我们还开发了一种高效的光谱算法,称为DIS,它基于人口左侧和右侧单一矢量中固有的简单x结构设计,用以估计定向网络中行节点和列节点的混合成员构成。我们表明,DISP在温和条件下,通过对每个行节点和每列节点的推断成员矢量提供错误的界限,使用微妙的光谱分析,对每个列节点的推断矢量进行误差。我们展示了DIS的优势,它应用模拟了定向混合成员网络、定向政治博客网络和文件引用网络。