Given a graph, the densest subgraph problem asks for a set of vertices such that the average degree among these vertices is maximized. Densest subgraph has numerous applications in learning, e.g., community detection in social networks, link spam detection, correlation mining, bioinformatics, and so on. Although there are efficient algorithms that output either exact or approximate solutions to the densest subgraph problem, existing algorithms may violate the privacy of the individuals in the network, e.g., leaking the existence/non-existence of edges. In this paper, we study the densest subgraph problem in the framework of the differential privacy, and we derive the first upper and lower bounds for this problem. We show that there exists a linear-time $\epsilon$-differentially private algorithm that finds a $2$-approximation of the densest subgraph with an extra poly-logarithmic additive error. Our algorithm not only reports the approximate density of the densest subgraph, but also reports the vertices that form the dense subgraph. Our upper bound almost matches the famous $2$-approximation by Charikar both in performance and in approximation ratio, but we additionally achieve differential privacy. In comparison with Charikar's algorithm, our algorithm has an extra poly-logarithmic additive error. We partly justify the additive error with a new lower bound, showing that for any differentially private algorithm that provides a constant-factor approximation, a sub-logarithmic additive error is inherent. We also practically study our differentially private algorithm on real-world graphs, and we show that in practice the algorithm finds a solution which is very close to the optimal
翻译:在图形中, 最稠密的下层问题要求一组螺旋, 使这些脊椎的平均程度最大化。 在本文中, 我们研究差异隐私权框架中最稠密的子系统问题, 我们从社会网络的社区检测、 链接垃圾检测、 相关采矿、 生物信息学等许多应用。 虽然有高效的算法, 输出精确或近似地解决最稠密的子系统问题, 但现有的算法可能会侵犯网络中个人的隐私, 例如, 泄露这些边缘的存在/ 不存在。 在本文中, 我们研究了差异隐私权框架中最稠密的子系统问题, 我们从中得出了这个问题的第一个上下游界限。 我们显示的是线性时间 $\ epslon- 差异性私人算法, 发现最稠密的子系统有2美元乘以超多logic的添加错误。 我们的算法不仅报告了最稠密的子系统的深度密度, 我们还报告了构成密度亚值亚值亚值的垂直值, 并且我们从实际的精确性算算中找到了一个非常精确的精确性。 我们的上, 我们的直观的直观的直观和直观的直径比, 我们用Charkaral 显示了我们得到了一个更接近的比值, 我们的比值的 。