Community detection is one of the most important and interesting issues in social network analysis. In recent years, simultaneous considering of nodes' attributes and topological structures of social networks in the process of community detection has attracted the attentions of many scholars, and this consideration has been recently used in some community detection methods to increase their efficiencies and to enhance their performances in finding meaningful and relevant communities. But the problem is that most of these methods tend to find non-overlapping communities, while many real-world networks include communities that often overlap to some extent. In order to solve this problem, an evolutionary algorithm called MOBBO-OCD, which is based on multi-objective biogeography-based optimization (BBO), is proposed in this paper to automatically find overlapping communities in a social network with node attributes with synchronously considering the density of connections and the similarity of nodes' attributes in the network. In MOBBO-OCD, an extended locus-based adjacency representation called OLAR is introduced to encode and decode overlapping communities. Based on OLAR, a rank-based migration operator along with a novel two-phase mutation strategy and a new double-point crossover are used in the evolution process of MOBBO-OCD to effectively lead the population into the evolution path. In order to assess the performance of MOBBO-OCD, a new metric called alpha_SAEM is proposed in this paper, which is able to evaluate the goodness of both overlapping and non-overlapping partitions with considering the two aspects of node attributes and linkage structure. Quantitative evaluations reveal that MOBBO-OCD achieves favorable results which are quite superior to the results of 15 relevant community detection algorithms in the literature.
翻译:社区探测是社会网络分析中最重要和最有趣的问题之一。近年来,在社区探测过程中同时考虑社会网络的节点属性和地形结构,引起了许多学者的注意,最近一些社区探测方法也采用了这种考虑,以提高其效率,并提高其在寻找有意义和相关社区方面的绩效。但问题是,大多数这些方法往往发现非重叠社区,而许多现实世界网络则包括往往在某种程度上重叠的社区。为了解决这一问题,本文件提议采用一个名为MOBBBO-OCD的演进算法,该算法基于多目标生物地理上的重叠优化(BBBOO),在社会网络中自动发现重叠社区,考虑到网络连接的密度和节点属性的相似性,从而同步地发现社区的效率。在MOBOO-OC文献中,一个名为OLARA的扩大的基于地表的相近似近似性代表度代表法被引入编码和解码重叠社区。基于OLAR的双级移徙操作者,连同一个新的两阶段的超级变异性OOC 战略以及一个新的双点业绩评估,在MOB中,一个新的双点评估了该级变异性结构。