We consider the problem of generating private synthetic versions of real-world graphs containing private information while maintaining the utility of generated graphs. Differential privacy is a gold standard for data privacy, and the introduction of the differentially private stochastic gradient descent (DP-SGD) algorithm has facilitated the training of private neural models in a number of domains. Recent advances in graph generation via deep generative networks have produced several high performing models. We evaluate and compare state-of-the-art models including adjacency matrix based models and edge based models, and show a practical implementation that favours the edge-list approach utilizing the Gaussian noise mechanism when evaluated on commonly used graph datasets. Based on our findings, we propose a generative model that can reproduce the properties of real-world networks while maintaining edge-differential privacy. The proposed model is based on a stochastic neural network that generates discrete edge-list samples and is trained using the Wasserstein GAN objective with the DP-SGD optimizer. Being the first approach to combine these beneficial properties, our model contributes to further research on graph data privacy.
翻译:我们考虑了在保持所生成的图表的实用性的同时生成私人合成版本包含私人信息的真实世界图的问题。不同的隐私是数据隐私的金质标准,采用差别私人随机梯度下降算法(DP-SGD)有助于在若干领域对私人神经模型进行培训。最近通过深层基因化网络生成的图形的最新进展产生了几种高性能模型。我们评估和比较了最新的先进模型,包括基于对等矩阵的模型和基于边缘的模型,并展示了一种实际的实施,有利于在用常用的图表数据集进行评估时使用高斯语噪音机制的边缘列表方法。根据我们的调查结果,我们提出了一个可复制现实世界网络特性的基因模型,同时保持边缘偏差隐私。拟议模型基于产生离子边缘列表样本的随机神经网络,并且使用瓦列斯特因 GAN 目标与DP-SGD 优化器进行了培训。作为这些有益特性的第一个结合方法,我们的模型有助于进一步研究图表数据隐私。