Local graph clustering is an important algorithmic technique for analysing massive graphs, and has been widely applied in many research fields of data science. While the objective of most (local) graph clustering algorithms is to find a vertex set of low conductance, there has been a sequence of recent studies that highlight the importance of the inter-connection between clusters when analysing real-world datasets. Following this line of research, in this work we study local algorithms for finding a pair of vertex sets defined with respect to their inter-connection and their relationship with the rest of the graph. The key to our analysis is a new reduction technique that relates the structure of multiple sets to a single vertex set in the reduced graph. Among many potential applications, we show that our algorithms successfully recover densely connected clusters in the Interstate Disputes Dataset and the US Migration Dataset.
翻译:本地图集是分析大型图集的重要算法技术,并广泛应用于数据科学的许多研究领域。虽然大多数(本地)图集算法的目标是找到一组低导力的顶点,但最近的一系列研究显示,在分析真实世界数据集时,各组之间相互联系的重要性。根据这一研究方针,我们研究当地算法,以寻找一对因彼此连接及其与图中其余部分的关系而确定的顶点。我们分析的关键是一种新的缩小技术,将多组结构与减少的图集中的单一顶点联系起来。在很多潜在应用中,我们显示我们的算法成功地恢复了国家间争端数据集和美国迁移数据集中的密集连接组。