高层次沙质块模型旁观群集的一致性 (Consistency of Spectral Clustering on Hierarchical Stochastic Block Models)

We study the hierarchy of communities in real-world networks under a generic stochastic block model, in which the connection probabilities are structured in a binary tree. Under such model, a standard recursive bi-partitioning algorithm is dividing the network into two communities based on the Fiedler vector of the unnormalized graph Laplacian and repeating the split until a stopping rule indicates no further community structures. We prove the strong consistency of this method under a wide range of model parameters, which include sparse networks with node degrees as small as $O(\log n)$. In addition, unlike most of existing work, our theory covers multiscale networks where the connection probabilities may differ by orders of magnitude, which comprise an important class of models that are practically relevant but technically challenging to deal with. Finally we demonstrate the performance of our algorithm on synthetic data and real-world examples.

翻译：我们在一个通用的随机区块模型下研究现实世界网络中的社区等级,其中连接概率在一棵二树上结构。在这种模型下,标准的递归双分算法将网络分成两个社区,以非正常的图解的Fiedler矢量为基础,重复这一划分,直到停止规则表明没有进一步的社区结构。我们证明这种方法在一系列广泛的模型参数下非常一致,其中包括小于O(\log n)美元的节点的稀少网络。此外,与大多数现有工作不同,我们的理论涵盖多尺度网络,其中连接概率可能因数量大小而不同,这构成了一个重要的模型类别,实际上具有相关性,但在技术上具有挑战性。最后,我们展示了我们在合成数据和现实世界实例方面的算法表现。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日