更好、更少、更好:用外部线进行社区搜索 (Better Fewer but Better: Community Search with Outliers)

Given a set of vertices in a network, that we believe are of interest for the application under analysis, community search is the problem of producing a subgraph potentially explaining the relationships existing among the vertices of interest. In practice this means that the solution should add some vertices to the query ones, so to create a connected subgraph that exhibits some "cohesiveness" property. This problem has received increasing attention in recent years: while several cohesiveness functions have been studied, the bulk of the literature looks for a solution subgraphs containing all the query vertices. However, in many exploratory analyses we might only have a reasonable belief about the vertices of interest: if only one of them is not really related to the others, forcing the solution to include all of them might hide the existence of much more cohesive and meaningful subgraphs, that we could have found by allowing the solution to detect and drop the outlier vertex. In this paper we study the problem of community search with outliers, where we are allowed to drop up to $k$ query vertices, with $k$ being an input parameter. We consider three of the most used measures of cohesiveness: the minimum degree, the diameter of the subgraph and the maximum distance with a query vertex. By optimizing one and using one of the others as a constraint we obtain three optimization problems: we study their hardness and we propose different exact and approximation algorithms.

翻译：考虑到网络中的一系列顶点,我们认为,对于所分析的应用来说,这一系列顶点是值得注意的,因此,社区搜索是制作一个子集的问题,它有可能解释利益顶点之间存在的关系。在实践上,这意味着解决办法应在查询顶点上添加一些顶点,从而创建一个连接的顶点,以显示某种“共性”属性。近年来,这个问题日益受到越来越多的关注:虽然研究了一些凝聚力功能,但大部分文献都寻找包含所有查询顶点的解决方案子集。然而,在许多探索性分析中,我们可能只对利益顶点中的顶点有合理的信念:如果其中的一个顶点与其它顶点没有真正关联,那么就意味着解决办法应在查询顶点上添加一些顶点,从而将所有顶点都包含起来,这样我们就可以通过让解决办法探测和丢弃外端的顶点来找到。在这份文件中,我们研究社区搜索的问题,在那里我们可以将查询的顶点降到$$,我们只能对顶点的顶点有一个合理的底点,我们用一个最精确的底点,我们用一个最精确的底点来研究。我们用一个最精确的顶点的底点来研究。我们考虑三个的顶点的底点的底点的底点。我们用一个最精确的底点的底点的底点。我们考虑三个的底点。我们用一个最深点的底点的底点的底点的底点,用一个最精确度。