A network may have weak signals and severe degree heterogeneity, and may be very sparse in one occurrence but very dense in another. SCORE (Jin, 2015) is a recent approach to network community detection. It accommodates severe degree heterogeneity and is adaptive to different levels of sparsity, but its performance for networks with weak signals is unclear. In this paper, we show that in a broad class of network settings where we allow for weak signals, severe degree heterogeneity, and a wide range of network sparsity, SCORE achieves prefect clustering and has the so-called "exponential rate" in Hamming clustering errors. The proof uses the most recent advancement on entry-wise bounds for the leading eigenvectors of the network adjacency matrix. The theoretical analysis assures us that SCORE continues to work well in the weak signal settings, but it does not rule out the possibility that SCORE may be further improved to have better performance in real applications, especially for networks with weak signals. As a second contribution of the paper, we propose SCORE+ as an improved version of SCORE. We investigate SCORE+ with 8 network data sets and found that it outperforms several representative approaches. In particular, for the 6 data sets with relatively strong signals, SCORE+ has similar performance as that of SCORE, but for the 2 data sets (Simmons, Caltech) with possibly weak signals, SCORE+ has much lower error rates. SCORE+ proposes several changes to SCORE. We carefully explain the rationale underlying each of these changes, using a mixture of theoretical and numerical study.
翻译:一个网络可能信号薄弱,而且存在严重程度的异质性,并且可能在一个情况中非常稀少,但在另一个情况中可能非常少。 SCORE(Jin, 2015)是网络群落检测的最新方法。它包含高度异质性,适应不同程度的广度,但对于信号薄弱的网络来说,其性能并不清楚。在本文中,我们表明,在一个广泛的网络设置类别中,我们允许薄弱信号、严重程度异质性,以及广泛的网络频度,SCORE能够实现省一级集群,并在Hamming群集错误中拥有所谓的“超度率 ” 。 SCORE(Jin, 2015) 是网络群集群群群群中最近的一种方法。SCORE(Jin, Jin, Jin, 2015 2015) 是网络中最高级的入门边框。但理论分析向我们保证,SCORE(S)在信号薄弱的情况下继续运行良好,但不排除SCORE(S)在实际应用中,特别是网络中可能改进性能改进的性能, 。作为第二贡献,我们建议SCORE+ 以更强的S) 数据格式的S 的S 以一些S 的S 数字的S 数字的S&S 的系统,我们发现S 的S 和S 的S 和S 的S 的S 的S 数字的S 的S 的S 的系统为相对性能。