Given a complete graph $G = (V, E)$ where each edge is labeled $+$ or $-$, the Correlation Clustering problem asks to partition $V$ into clusters to minimize the number of $+$edges between different clusters plus the number of $-$edges within the same cluster. Correlation Clustering has been used to model a large number of clustering problems in practice, making it one of the most widely studied clustering formulations. The approximability of Correlation Clustering has been actively investigated [BBC04, CGW05, ACN08], culminating in a $2.06$-approximation algorithm [CMSY15], based on rounding the standard LP relaxation. Since the integrality gap for this formulation is 2, it has remained a major open question to determine if the approximation factor of 2 can be reached, or even breached. In this paper, we answer this question affirmatively by showing that there exists a $(1.994 + \epsilon)$-approximation algorithm based on $O(1/\epsilon^2$) rounds of the Sherali-Adams hierarchy. In order to round a solution to the Sherali-Adams relaxation, we adapt the {\em correlated rounding} originally developed for CSPs [BRS11, GS11, RT12]. With this tool, we reach an approximation ratio of $2+\epsilon$ for Correlation Clustering. To breach this ratio, we go beyond the traditional triangle-based analysis by employing a global charging scheme that amortizes the total cost of the rounding across different triangles.
翻译:鉴于一个完整的GG=(V,E)美元,其中每个边缘的比值贴上美元或美元,关系分组问题要求将V美元分成一组,以尽量减少不同组群之间美元+美元之间的置值数,加上同一组群内美元-美元之间的置值。关系分组被用来模拟实际中大量组群问题,使其成为最广泛研究的组群配方之一。已经积极调查了关系分组的近似性[BBC04, CGW05, ACN08],在标准LP放松的基础上,最终将V美元分成组群,以尽量减少不同组群之间美元+美元的数目。由于这一公式的整体性差距是2,因此它仍然是一个重要的未决问题,以确定是否能够达到2的近似系数,甚至被打破。在本文件中,我们肯定地回答了这个问题,我们通过显示基于 $(1/\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\xxxxxxxxxxxxxxxxxxxxxx我们的系统系统(Slxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx)