Community detection methods attempt to divide a network into groups of nodes that share similar properties, thus revealing its large-scale structure. A major challenge when employing such methods is that they are often degenerate, typically yielding a complex landscape of competing answers. As an attempt to extract understanding from a population of alternative solutions, many methods exist to establish a consensus among them in the form of a single partition "point estimate" that summarizes the whole distribution. Here we show that it is in general not possible to obtain a consistent answer from such point estimates when the underlying distribution is too heterogeneous. As an alternative, we provide a comprehensive set of methods designed to characterize and summarize complex populations of partitions in a manner that captures not only the existing consensus, but also the dissensus between elements of the population. Our approach is able to model mixed populations of partitions where multiple consensuses can coexist, representing different competing hypotheses for the network structure. We also show how our methods can be used to compare pairs of partitions, how they can be generalized to hierarchical divisions, and be used to perform statistical model selection between competing hypotheses.
翻译:社区探测方法试图将网络分成具有类似特性的节点组,从而揭示其大规模结构。在采用这种方法时,一个重大挑战是,它们往往退化,通常产生复杂的相互竞争的答案。为了试图从众多的替代解决办法中获取理解,存在着许多方法,以一个单一的分隔“点估计”的形式在它们之间建立共识,该“点估计”可以概括整个分布情况。这里我们表明,在基本分布过于分散的情况下,一般不可能从这种点估计中获得一致的答案。作为一种替代办法,我们提供了一套综合的方法,用以描述和归纳复杂的分区人口,不仅捕捉到现有的共识,而且捕捉到人口各组成部分之间的偏移。我们的方法可以建模混合的分区人口,在多重共识可以共存的情况下,代表网络结构中不同的相互竞争的假设。我们还表明,如何使用我们的方法来比较对分区的对等,如何将它们普遍化为等级划分,并用来在相互竞争的假设之间进行统计模型的选择。