DP2-Pub:有差异的私自高差异数据出版物,带有变化式后随机化 (DP2-Pub: Differentially Private High-Dimensional Data Publication with Invariant Post Randomization) - 专知论文

会员服务 ·

0

簇 · 不变 · Extensibility · Analysis · INFORMS ·

2022 年 8 月 24 日

DP2-Pub: Differentially Private High-Dimensional Data Publication with Invariant Post Randomization

翻译：DP2-Pub:有差异的私自高差异数据出版物,带有变化式后随机化

Honglu Jiang,Haotian Yu,Xiuzhen Cheng,Jian Pei,Robert Pless,Jiguo Yu

A large amount of high-dimensional and heterogeneous data appear in practical applications, which are often published to third parties for data analysis, recommendations, targeted advertising, and reliable predictions. However, publishing these data may disclose personal sensitive information, resulting in an increasing concern on privacy violations. Privacy-preserving data publishing has received considerable attention in recent years. Unfortunately, the differentially private publication of high dimensional data remains a challenging problem. In this paper, we propose a differentially private high-dimensional data publication mechanism (DP2-Pub) that runs in two phases: a Markov-blanket-based attribute clustering phase and an invariant post randomization (PRAM) phase. Specifically, splitting attributes into several low-dimensional clusters with high intra-cluster cohesion and low inter-cluster coupling helps obtain a reasonable allocation of privacy budget, while a double-perturbation mechanism satisfying local differential privacy facilitates an invariant PRAM to ensure no loss of statistical information and thus significantly preserves data utility. We also extend our DP2-Pub mechanism to the scenario with a semi-honest server which satisfies local differential privacy. We conduct extensive experiments on four real-world datasets and the experimental results demonstrate that our mechanism can significantly improve the data utility of the published data while satisfying differential privacy.

翻译：大量高维和多元数据出现在实际应用中,这些应用往往向第三方公布数据分析、建议、有针对性的广告和可靠的预测;然而,公布这些数据可能披露个人敏感信息,导致对侵犯隐私行为日益关注;近年来,隐私保护数据出版受到相当重视;不幸的是,高维数据的不同私下出版仍然是一个具有挑战性的问题;在本文件中,我们提议建立一个有区别的私人高维数据出版机制(DP2-Pubb),分两个阶段运行:以Markov为基点的属性聚合阶段和无变式后随机化(PRAM)阶段。具体地说,将属性分割成几个低维群,集群内凝聚程度高和集群间混合程度低,有助于获得对隐私预算的合理分配,而满足本地差异隐私的双重扰动性机制则便利了无变式的PRAM,以确保统计信息不丢失,从而大大保护数据效用。我们还将我们的DP2-Pub机制扩展为设想情景,配有满足当地差异隐私的半声波服务器。我们广泛试验了四个真实世界数据机制,同时进行广泛的实用性数据交换。

0

相关内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

论文周报 | 推荐系统领域最新研究进展

论文周报 | 推荐系统领域最新研究进展

机器学习与推荐算法

2+阅读 · 2022年4月11日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

bantam对Dpp信号通路的调节影响神经胶质细胞及前体干细胞的增殖

国家自然科学基金

0+阅读 · 2015年12月31日

广义单调（增生）算子的零点逼近与分裂可行问题的正则化研究

国家自然科学基金

0+阅读 · 2014年12月31日

一类微分半变分不等式问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

脊髓损伤后Fractalkine -小胶质细胞-星型胶质细胞回路介导胶质瘢痕形成的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

多复变函数空间上的算子理论

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

CANIFE: Crafting Canaries for Empirical Privacy Measurement in Federated Learning

Arxiv

0+阅读 · 2022年10月6日

Differentially Private Speaker Anonymization

Arxiv

0+阅读 · 2022年10月6日

On the Statistical Complexity of Estimation and Testing under Privacy Constraints

Arxiv

0+阅读 · 2022年10月5日

PrivTrace: Differentially Private Trajectory Synthesis by Adaptive Markov Model

Arxiv

0+阅读 · 2022年10月5日

Composition of Differential Privacy & Privacy Amplification by Subsampling

Arxiv

0+阅读 · 2022年10月4日

Robust self-healing prediction model for high dimensional data

Arxiv

0+阅读 · 2022年10月4日

Individualized PATE: Differentially Private Machine Learning with Individual Privacy Guarantees

Arxiv

0+阅读 · 2022年10月3日

Inference on High-dimensional Single-index Models with Streaming Data

Arxiv

0+阅读 · 2022年10月3日

Frequency Estimation of Evolving Data Under Local Differential Privacy

Arxiv

0+阅读 · 2022年10月1日

Differentially Private Optimization on Large Model at Small Cost

Arxiv

0+阅读 · 2022年9月30日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

论文周报 | 推荐系统领域最新研究进展

论文周报 | 推荐系统领域最新研究进展

机器学习与推荐算法

2+阅读 · 2022年4月11日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Call for Nominations: 2022 Multimedia Prize Paper Award

Call for Nominations: 2022 Multimedia Prize Paper Award

CCF多媒体专委会

0+阅读 · 2022年2月12日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LibRec 精选：推荐系统的论文与源码

LibRec 精选：推荐系统的论文与源码

LibRec智能推荐

14+阅读 · 2018年11月29日

相关论文

CANIFE: Crafting Canaries for Empirical Privacy Measurement in Federated Learning

Arxiv

0+阅读 · 2022年10月6日

Differentially Private Speaker Anonymization

Arxiv

0+阅读 · 2022年10月6日

On the Statistical Complexity of Estimation and Testing under Privacy Constraints

Arxiv

0+阅读 · 2022年10月5日

PrivTrace: Differentially Private Trajectory Synthesis by Adaptive Markov Model

Arxiv

0+阅读 · 2022年10月5日

Composition of Differential Privacy & Privacy Amplification by Subsampling

Arxiv

0+阅读 · 2022年10月4日

Robust self-healing prediction model for high dimensional data

Arxiv

0+阅读 · 2022年10月4日

Individualized PATE: Differentially Private Machine Learning with Individual Privacy Guarantees

Arxiv

0+阅读 · 2022年10月3日

Inference on High-dimensional Single-index Models with Streaming Data

Arxiv

0+阅读 · 2022年10月3日

Frequency Estimation of Evolving Data Under Local Differential Privacy

Arxiv

0+阅读 · 2022年10月1日

Differentially Private Optimization on Large Model at Small Cost

Arxiv

0+阅读 · 2022年9月30日

相关基金

bantam对Dpp信号通路的调节影响神经胶质细胞及前体干细胞的增殖

国家自然科学基金

0+阅读 · 2015年12月31日

广义单调（增生）算子的零点逼近与分裂可行问题的正则化研究

国家自然科学基金

0+阅读 · 2014年12月31日

一类微分半变分不等式问题研究

国家自然科学基金

0+阅读 · 2014年12月31日

脊髓损伤后Fractalkine -小胶质细胞-星型胶质细胞回路介导胶质瘢痕形成的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有临界指数的Schrodinger-Poisson系统的解

国家自然科学基金

0+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

非凸Hamilton系统的Aubry-Mather理论

国家自然科学基金

0+阅读 · 2012年12月31日

多复变函数空间上的算子理论

国家自然科学基金

0+阅读 · 2012年12月31日

Cystatin B缺失与Prion疾病自噬作用机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Toll 样受体介导的巨噬细胞对prion清除的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员