Modern network datasets are often composed of multiple layers, either as different views, time-varying observations, or independent sample units, resulting in collections of networks over the same set of vertices but with potentially different connectivity patterns on each network. These data require models and methods that are flexible enough to capture local and global differences across the networks, while at the same time being parsimonious and tractable to yield computationally efficient and theoretically sound solutions that are capable of aggregating information across the networks. This paper considers the multilayer degree-corrected stochastic blockmodel, where a collection of networks share the same community structure, but degree-corrections and block connection probability matrices are permitted to be different. We establish the identifiability of this model and propose a spectral clustering algorithm for community detection in this setting. Our theoretical results demonstrate that the misclustering error rate of the algorithm improves exponentially with multiple network realizations, even in the presence of significant layer heterogeneity with respect to degree corrections, signal strength, and spectral properties of the block connection probability matrices. Simulation studies show that this approach improves on existing multilayer community detection methods in this challenging regime. Furthermore, in a case study of US airport data through January 2016 -- September 2021, we find that this methodology identifies meaningful community structure and trends in airport popularity influenced by pandemic impacts on travel.
翻译:现代网络数据集通常由多层组成,既有不同的观点、时间变化式的观测,也有独立的抽样单位,因此收集了同一一组脊椎的网络,但每个网络的连接模式可能不同。这些数据需要模型和方法,这些模型和方法足够灵活,足以捕捉各网络之间的地方和全球差异,而与此同时,这些模型和方法又具有相似性和可移植性,可产生计算高效和理论上健全的解决方案,能够汇集各网络的信息。本文考虑了多层级的经度校正的随机区块模型,收集的网络具有相同的社区结构,但允许有不同的程度校正和区块连接概率矩阵。我们建立了这一模型的可识别性,并提出了在这种环境下社区检测的光谱组合算法。我们的理论结果表明,算法错误率随着多个网络的实现而快速提高,即使存在大量层分解,在区系连接概率矩阵的度、信号强度和光谱性特性方面,也存在重大的分层差异。模拟研究显示,这一方法改进了现有多层社区在9月20号社区检测方法上对机场大流行病的影响,我们通过2016年1月20号机场的案例研究确定了这一具有挑战性的趋势。