Higher-order structures of networks, namely, small subgraphs of networks (also called network motifs), are widely known to be crucial and essential to the organization of networks. There has been a few work studying the community detection problem\textendash a fundamental problem in network analysis, at the level of motifs. In particular, higher-order spectral clustering has been developed, where the notion of \emph{motif adjacency matrix} is introduced as the input of the algorithm. However, it remains largely unknown that how higher-order spectral clustering works and when it performs better than its edge-based counterpart. To elucidate these problems, we investigate higher-order spectral clustering from a statistical perspective. In particular, we theoretically study the clustering performance of higher-order spectral clustering under a {\it weighted stochastic block model} and compare the resulting bounds with the corresponding results of edge-based spectral clustering. It turns out that when the network is dense with weak signal of weights, higher-order spectral clustering can really lead to the performance gain in clustering. We also use simulations and real data experiments to support the findings.
翻译:网络的较高层次结构,即网络的小型子集成(也称为网络主机),广为人知,对网络的组织至关重要,对网络的组织至关重要。在对社区检测问题/文字学的研究中,在网络分析中,在motifs层次上,出现了一些根本性的问题。特别是,开发了较高层次的光谱集成,引入了\emph{motif 相邻矩阵的概念作为算法的输入。但是,人们仍然基本上不知道,高层次的光谱集成如何运作,以及当它比边基组群表现更好时,我们从统计角度来调查更高层次的光谱集成问题。特别是,我们理论上研究在推式加权光谱区块模型下较高层次的光谱集集成的集成性能,并将由此产生的界限与边基光谱集成的相应结果进行比较。事实证明,当网络密度较弱时,高层次的光谱集集集成能够真正导致集成的绩效增益。我们还利用模拟和真实的数据实验来支持调查结果。