The stochastic block model (SBM) is a random graph model with different group of vertices connecting differently. It is widely employed as a canonical model to study clustering and community detection, and provides a fertile ground to study the information-theoretic and computational tradeoffs that arise in combinatorial statistics and more generally data science. This monograph surveys the recent developments that establish the fundamental limits for community detection in the SBM, both with respect to information-theoretic and computational tradeoffs, and for various recovery requirements such as exact, partial and weak recovery. The main results discussed are the phase transitions for exact recovery at the Chernoff-Hellinger threshold, the phase transition for weak recovery at the Kesten-Stigum threshold, the optimal SNR-mutual information tradeoff for partial recovery, and the gap between information-theoretic and computational thresholds. The monograph gives a principled derivation of the main algorithms developed in the quest of achieving the limits, in particular two-round algorithms via graph-splitting, semi-definite programming, (linearized) belief propagation, classical/nonbacktracking spectral methods and graph powering. Extensions to other block models, such as geometric block models, and a few open problems are also discussed.
翻译:随机图表模型(SBM)是一个随机的图形模型,有不同组合的脊椎不同连接。它被广泛用作研究集群和社区检测的金字塔模型,为研究组合统计和更广泛的数据科学中出现的信息理论和计算平衡提供了肥沃的土壤。该专论调查了在信息理论和计算取舍以及各种回收要求(如精确、部分和薄弱的恢复)方面为在SBM中社区检测确定基本限制的最新发展情况。讨论的主要结果包括切尔诺夫-希腊人门槛精确恢复的阶段过渡、Kesten-Stugum临界点恢复缓慢的阶段过渡、部分恢复的最佳SNR-相互信息权衡,以及信息理论与计算阈值之间的差距。该专论为寻求实现这些限制而制定的主要算提供了原则性衍生,特别是通过图形分裂、半确定性程序、半确定性程序、(线性)定的两轮算算法的阶段过渡、Kesten-Stugum临界临界点临界点临界点临界点临界点的恢复阶段过渡阶段过渡、部分恢复的最佳SNR-Moverial-cal trackal tracksal laism laim sal laism slations ands and sal lavelations andsismlations and slation y squslusluslationslations y fals y sqslupluplationslations ands ands and sqslationslationslationslations ands laismusmusmusmusmus andsmus and sqsmus ands and sqsmus andslgy sqslationslations ands y sqslationslupslismsldsldsldsmsldsldsldsldslddddddddddddddds ands yds ands and sqs and sqs ands ands and sqslddsldslds ysldsldsldsldsldslds ex, ex, ex,