We study the problem of graph clustering under a broad class of objectives in which the quality of a cluster is defined based on the ratio between the number of edges in the cluster, and the total weight of vertices in the cluster. We show that our definition is closely related to popular clustering measures, namely normalized associations, which is a dual of the normalized cut objective, and normalized modularity. We give a linear time constant-approximate algorithm for our objective, which implies the first constant-factor approximation algorithms for normalized modularity and normalized associations.
翻译:我们研究在一大类目标下进行图表分组的问题,在这类目标中,根据组群边缘数与组群脊椎总重量之比来界定组群的质量。我们表明,我们的定义与大众集群措施密切相关,即:标准化协会,这是标准化削减目标的双重,以及标准化模块化。我们为我们的目标给出了直线时间常数近似算法,这意味着标准化模块化和标准化协会的首个常数要素近似算法。