Mixed membership problem for undirected network has been well studied in network analysis recent years. However, the more general case of mixed membership for directed network in which nodes can belong to multiple communities remains a challenge. Here, we propose an interpretable and identifiable model: directed mixed membership stochastic blockmodel (DiMMSB) for directed mixed membership networks. DiMMSB allows that row nodes and column nodes of the adjacency matrix can be different and these nodes may have distinct community structure in a directed network. We also develop an efficient spectral algorithm called DiSP designed based on simplex structures inherent in the left and right singular vectors of the population adjacency matrix to estimate the mixed memberships for both row nodes and column nodes in a directed network. We show that DiSP is asymptotically consistent under mild conditions by providing error bounds for the inferred membership vectors of each row node and each column node using delicate spectral analysis. Numerical results on computer-generated directed mixed membership networks support our theoretical findings and show that our DiSP outperforms its competitor in both error rates and run-time. Applications of DiSP to real-world directed networks demonstrate the advantages of DiSP in studying the asymmetric structure of directed networks.
翻译:近些年来,在网络分析中,对非定向网络的混合成员问题进行了深入的研究,然而,对定向网络的混合成员问题进行了更为广泛的研究,其中节点可以归属多个社区。这里,我们提出了一个可解释和可识别的模式:定向混合成员制组合模式(DIMMSB),用于定向混合成员制网络。DIMMSB允许对相邻矩阵的行节点和列节点进行不同的研究,而这些节点在定向网络中可能有不同的社区结构。我们还开发了一个高效的光谱算法,称为DISP,其设计基于人口左侧和右侧单向矢量所固有的简单结构。我们提出了一种可解释和可识别的模式:即定向混合成员制混合成员制混合成员制模式(DIMMSB)为定向混合成员制模式(DIMMSB)。我们表明,DiSP在温和条件下,通过对每个行节点的推定成员制矢量和每列节点使用微妙的光谱分析,在轻度的频谱分析,从而得出了不同的社区结构。计算机生成的定向混合成员网络的数值结果支持我们的理论发现,并表明,DISSP在实时网络中都显示,在正态网络的优势和运行中均态网络结构上。