利用公共数据进行实际私自查询释放 (Leveraging Public Data for Practical Private Query Release) - 专知论文

会员服务 ·

0

统计量 · ACS · state-of-the-art · Performance · 缩放 ·

2021 年 6 月 11 日

Leveraging Public Data for Practical Private Query Release

翻译：利用公共数据进行实际私自查询释放

Terrance Liu,Giuseppe Vietri,Thomas Steinke,Jonathan Ullman,Zhiwei Steven Wu

from arxiv, ICML 2021

In many statistical problems, incorporating priors can significantly improve performance. However, the use of prior knowledge in differentially private query release has remained underexplored, despite such priors commonly being available in the form of public datasets, such as previous US Census releases. With the goal of releasing statistics about a private dataset, we present PMW^Pub, which -- unlike existing baselines -- leverages public data drawn from a related distribution as prior information. We provide a theoretical analysis and an empirical evaluation on the American Community Survey (ACS) and ADULT datasets, which shows that our method outperforms state-of-the-art methods. Furthermore, PMW^Pub scales well to high-dimensional data domains, where running many existing methods would be computationally infeasible.

翻译：在许多统计问题中,将先期数据纳入前期数据可以大大改善业绩。然而,尽管以前通常以公共数据集的形式提供,例如美国以前的人口普查发布,但先前在不同私人查询发布方面的知识的使用仍然没有得到充分利用。为了公布关于私人数据集的统计数据,我们介绍了PMW ⁇ Pub, 与现有的基线不同,它利用从相关分发中获取的公共数据作为先前的信息。我们提供了关于美国社区调查(ACS)和ADUT数据集的理论分析和实证评估,这表明我们的方法优于最新方法。此外,PMW ⁇ Pub尺度也很好地适用于高维数据领域,而许多现有方法的运行在计算上是行不通的。

0

相关内容

统计量

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

专知会员服务

10+阅读 · 2020年5月14日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

专知会员服务

36+阅读 · 2020年3月19日

【KDD2019|讲座推荐】公平意识机器学习：现实挑战与经验教训：Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

专知会员服务

20+阅读 · 2019年12月9日

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

专知会员服务

17+阅读 · 2019年12月9日

【KDD2019|讲座推荐】在线控制实验结果评估的挑战、最佳实践和陷阱：Challenges, Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments

【KDD2019|讲座推荐】在线控制实验结果评估的挑战、最佳实践和陷阱：Challenges, Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments

专知会员服务

4+阅读 · 2019年12月4日

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

专知会员服务

15+阅读 · 2019年11月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

已删除

将门创投

3+阅读 · 2019年9月4日

计算机 | 中低难度国际会议信息8条

计算机 | 中低难度国际会议信息8条

Call4Papers

9+阅读 · 2019年6月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

A Distance Measure for Privacy-preserving Process Mining based on Feature Learning

A Distance Measure for Privacy-preserving Process Mining based on Feature Learning

Arxiv

0+阅读 · 2021年8月10日

Synthetic Benchmarks for Scientific Research in Explainable Machine Learning

Arxiv

0+阅读 · 2021年8月6日

Mitigating dataset harms requires stewardship: Lessons from 1000 papers

Arxiv

0+阅读 · 2021年8月6日

Differentially Private n-gram Extraction

Arxiv

1+阅读 · 2021年8月5日

Uncertainty-Aware Reliable Text Classification

Arxiv

8+阅读 · 2021年7月15日

Recent Advances in Large Margin Learning

Arxiv

12+阅读 · 2021年3月25日

MMKG: Multi-Modal Knowledge Graphs

Arxiv

30+阅读 · 2019年3月13日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

Data augmentation using learned transforms for one-shot medical image segmentation

Arxiv

5+阅读 · 2019年2月25日

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Arxiv

4+阅读 · 2017年11月15日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

专知会员服务

10+阅读 · 2020年5月14日

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

【SIGMOD2020】知识图谱补全方法的现实再评价，Realistic Re-evaluation of Knowledge Graph Completion Methods: An Experimental Study

专知会员服务

33+阅读 · 2020年3月23日

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

专知会员服务

36+阅读 · 2020年3月19日

【KDD2019|讲座推荐】公平意识机器学习：现实挑战与经验教训：Fairness-Aware Machine Learning: Practical Challenges and Lessons Learned

专知会员服务

20+阅读 · 2019年12月9日

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

【KDD2019|讲座推荐】现代MDL与数据挖掘的结合--洞察力、理论和实践：Modern MDL meets Data Mining -- Insights, Theory, and Practice

专知会员服务

17+阅读 · 2019年12月9日

【KDD2019|讲座推荐】在线控制实验结果评估的挑战、最佳实践和陷阱：Challenges, Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments

【KDD2019|讲座推荐】在线控制实验结果评估的挑战、最佳实践和陷阱：Challenges, Best Practices and Pitfalls in Evaluating Results of Online Controlled Experiments

专知会员服务

4+阅读 · 2019年12月4日

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

【AAAI Tutorials 2019】联合学习：机器学习中的用户隐私，数据安全性和机密性（Federated Learning: User Privacy, Data Security and Confidentiality in Machine Learning）

专知会员服务

15+阅读 · 2019年11月18日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】《知识图谱与大语言模型的协同应用》，544页pdf

军事通信系统：安全行动的支柱

《缓解大语言模型（LLMs）幻觉：面向应用的检索增强生成（RAG）、推理与智能体系统综述》

【新书】机器学习系统，2620页pdf

相关资讯

已删除

将门创投

3+阅读 · 2019年9月4日

计算机 | 中低难度国际会议信息8条

计算机 | 中低难度国际会议信息8条

Call4Papers

9+阅读 · 2019年6月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

CCF C类 | DSAA 2019 诚邀稿件

CCF C类 | DSAA 2019 诚邀稿件

Call4Papers

6+阅读 · 2019年5月13日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

相关论文

A Distance Measure for Privacy-preserving Process Mining based on Feature Learning

A Distance Measure for Privacy-preserving Process Mining based on Feature Learning

Arxiv

0+阅读 · 2021年8月10日

Synthetic Benchmarks for Scientific Research in Explainable Machine Learning

Arxiv

0+阅读 · 2021年8月6日

Mitigating dataset harms requires stewardship: Lessons from 1000 papers

Arxiv

0+阅读 · 2021年8月6日

Differentially Private n-gram Extraction

Arxiv

1+阅读 · 2021年8月5日

Uncertainty-Aware Reliable Text Classification

Arxiv

8+阅读 · 2021年7月15日

Recent Advances in Large Margin Learning

Arxiv

12+阅读 · 2021年3月25日

MMKG: Multi-Modal Knowledge Graphs

Arxiv

30+阅读 · 2019年3月13日

Efficient Parameter-free Clustering Using First Neighbor Relations

Efficient Parameter-free Clustering Using First Neighbor Relations

Arxiv

7+阅读 · 2019年2月28日

Data augmentation using learned transforms for one-shot medical image segmentation

Arxiv

5+阅读 · 2019年2月25日

DuReader: a Chinese Machine Reading Comprehension Dataset from Real-world Applications

Arxiv

4+阅读 · 2017年11月15日

微信扫码咨询专知VIP会员