Community detection for unweighted networks has been widely studied in network analysis, but the case of weighted networks remains a challenge. This paper proposes a general Distribution-Free Model (DFM) for weighted networks in which nodes are partitioned into different communities. DFM can be seen as a generalization of the famous stochastic blockmodels from unweighted networks to weighted networks. DFM does not require prior knowledge of a specific distribution for elements of the adjacency matrix but only the expected value. In particular, signed networks with latent community structures can be modeled by DFM. We build a theoretical guarantee to show that a simple spectral clustering algorithm stably yields consistent community detection under DFM. We also propose a four-step data generation process to generate adjacency matrices with missing edges by combining DFM, noise matrix, and a model for unweighted networks. Using experiments with simulated and real datasets, we show that some benchmark algorithms can successfully recover community membership for weighted networks generated by the proposed data generation process.
翻译:在网络分析中,对未加权网络的社区检测进行了广泛的研究,但加权网络的情况仍是一个挑战。本文件提议为加权网络制定一个通用的无分配模式,将节点分割到不同的社区。DFM可被视为从未加权网络到加权网络的著名随机区块模型的概括化。DFM并不要求事先了解对相邻矩阵各元素的具体分布情况,而只要求预期值。特别是,DFM可以模拟具有潜在社区结构的已签字网络。我们建立了一个理论保证,以显示一个简单的光谱集群算法在DFM下稳定地得出一致的社区检测结果。我们还提议了一个四步数据生成过程,通过将DFM、噪音矩阵和未加权网络模型相结合,产生相匹配边缘的相近矩阵。我们利用模拟和真实数据集的实验,表明一些基准算法可以成功地恢复拟议数据生成过程生成的加权网络的社区成员资格。