Finding dense subgraphs of a large graph is a standard problem in graph mining that has been studied extensively both for its theoretical richness and its many practical applications. In this paper we introduce a new family of dense subgraph objectives, parameterized by a single parameter $p$, based on computing generalized means of degree sequences of a subgraph. Our objective captures both the standard densest subgraph problem and the maximum $k$-core as special cases, and provides a way to interpolate between and extrapolate beyond these two objectives when searching for other notions of dense subgraphs. In terms of algorithmic contributions, we first show that our objective can be minimized in polynomial time for all $p \geq 1$ using repeated submodular minimization. A major contribution of our work is analyzing the performance of different types of peeling algorithms for dense subgraphs both in theory and practice. We prove that the standard peeling algorithm can perform arbitrarily poorly on our generalized objective, but we then design a more sophisticated peeling method which for $p \geq 1$ has an approximation guarantee that is always at least $1/2$ and converges to 1 as $p \rightarrow \infty$. In practice, we show that this algorithm obtains extremely good approximations to the optimal solution, scales to large graphs, and highlights a range of different meaningful notions of density on graphs coming from numerous domains. Furthermore, it is typically able to approximate the densest subgraph problem better than the standard peeling algorithm, by better accounting for how the removal of one node affects other nodes in its neighborhood.
翻译:查找大图中密度的子图是一个标准的问题, 在图形开采中, 已经对其理论丰富性及其许多实际应用进行了广泛的研究。 在本文中, 我们引入了一组新的密密的子图目标, 以单一参数 $p$为参数, 以计算一个子图的通用度序列方式为基础。 我们的目标捕获了标准密密的子图问题, 以及最高 $k- 核心作为特例。 我们证明标准剥离算法可以对我们普遍目标进行任意的差错, 但是在寻找其它密集子图概念时, 我们设计了一个更精密的剥离方法。 在算法贡献方面, 我们首先显示我们的目标可以在多数值的多数值时间里最小的多数值里里里里, 我们工作的主要贡献是分析不同种类的剥离算算法的性表现, 其精确性格上的精确度比其它的直径直径直值要低得多, 其精确度的直径直径直径直径直到最深的直径直径直径直到最深的直径直径直径直径直径直到最接近的直径直径直径直径的直径直径, 。