The task of community detection, which aims to partition a network into clusters of nodes to summarize its large-scale structure, has spawned the development of many competing algorithms with varying objectives. Some community detection methods are inferential, explicitly deriving the clustering objective through a probabilistic generative model, while other methods are descriptive, dividing a network according to an objective motivated by a particular application, making it challenging to compare these methods on the same scale. Here we present a solution to this problem that associates any community detection objective, inferential or descriptive, with its corresponding implicit network generative model. This allows us to compute the description length of a network and its partition under arbitrary objectives, providing a principled measure to compare the performance of different algorithms without the need for "ground truth" labels. Our approach also gives access to instances of the community detection problem that are optimal to any given algorithm, and in this way reveals intrinsic biases in popular descriptive methods, explaining their tendency to overfit. Using our framework, we compare a number of community detection methods on artificial networks, and on a corpus of over 500 structurally diverse empirical networks. We find that more expressive community detection methods exhibit consistently superior compression performance on structured data instances, without having degraded performance on a minority of situations where more specialized algorithms perform optimally. Our results undermine the implications of the "no free lunch" theorem for community detection, both conceptually and in practice, since it is confined to unstructured data instances, unlike relevant community detection problems which are structured by requirement.
翻译:社区检测任务旨在将网络分成一组,将网络分成若干节点,以总结其大规模结构,由此产生了许多具有不同目标的竞争算法的发展。一些社区检测方法具有推论性,通过概率化基因模型明确得出组合目标,而其他方法则是描述性的,根据特定应用的驱动目标将网络分割成一个目标,因此在规模上比较这些方法具有挑战性。在这里,我们提出了一个解决问题的办法,将任何社区检测目标、推断性或描述性,与其相应的隐含网络基因化模型联系起来。这使我们能够根据武断的目标来计算网络及其分布的描述长度,提供一个原则性衡量标准,以比较不同算法的运行情况,而不需要“地面真相”标签。我们的方法还提供对社区检测问题进行描述的机会,根据特定应用的方法来比较这些方法的内在偏差,解释其过分的倾向。我们利用我们的框架,比较了一些社区检测方法,其结构化方法在结构化的网络和五百多个非结构化实验性网络上都有。我们发现,一种原则性的措施是,在不需“地面”的社区检测中,一个更清晰的特征性检验结果是持续地压压压低的,从任何特定的少数群体检测结果。</s>