We consider an approach for community detection in time-varying networks. At its core, this approach maintains a small sketch graph to capture the essential community structure found in each snapshot of the full network. We demonstrate how the sketch can be used to explicitly identify six key community events which typically occur during network evolution: growth, shrinkage, merging, splitting, birth and death. Based on these detection techniques, we formulate a community detection algorithm which can process a network concurrently exhibiting all processes. One advantage afforded by the sketch-based algorithm is the efficient handling of large networks. Whereas detecting events in the full graph may be computationally expensive, the small size of the sketch allows changes to be quickly assessed. A second advantage occurs in networks containing clusters of disproportionate size. The sketch is constructed such that there is equal representation of each cluster, thus reducing the possibility that the small clusters are lost in the estimate. We present a new standardized benchmark based on the stochastic block model which models the addition and deletion of nodes, as well as the birth and death of communities. When coupled with existing benchmarks, this new benchmark provides a comprehensive suite of tests encompassing all six community events. We provide analysis and a set of numerical results demonstrating the advantages of our approach both in run time and in the handling of small clusters.
翻译:我们考虑在时间变化的网络中进行社区探测的方法。 在其核心方面, 这种方法维持一个小草图图, 以捕捉整个网络的每个快照中发现的基本社区结构。 我们展示了如何利用草图明确确定在网络演变期间通常发生的六大社区事件: 增长、缩小、合并、合并、分裂、出生和死亡。 我们根据这些探测技术, 开发了一种社区探测算法, 可以同时处理显示所有过程的网络。 基于草图的算法提供的一个优势是高效率地处理大型网络。 完整图表中的探测事件可能计算得非常昂贵,但草图的面积较小,可以迅速评估变化。 第二个优势出现在包含过大集群的网络中。 草图的构建使每个集群具有同等的代表性,从而减少了小集群在估计中丢失的可能性。 我们根据一个以图式块模型为基础的新的标准化基准,该模型模拟了节点的增删,以及社区的诞生和死亡。 当与现有的基准相结合时, 新的基准提供了涵盖所有六个社区事件的全面测试套套。 我们提供了一个优势分析和一组数字结果, 展示了我们处理方法的优势和一组数字结果。