差异私人加权抽样 (Differentially Private Weighted Sampling) - 专知论文

会员服务 ·

0

Weight · Performer · 估计/估计量 · 样本 · 统计量 ·

2021 年 3 月 31 日

Differentially Private Weighted Sampling

翻译：差异私人加权抽样

Edith Cohen,Ofir Geri,Tamas Sarlos,Uri Stemmer

from arxiv, 38 pages, 9 figures

Common datasets have the form of elements with keys (e.g., transactions and products) and the goal is to perform analytics on the aggregated form of key and frequency pairs. A weighted sample of keys by (a function of) frequency is a highly versatile summary that provides a sparse set of representative keys and supports approximate evaluations of query statistics. We propose private weighted sampling (PWS): A method that ensures element-level differential privacy while retaining, to the extent possible, the utility of a respective non-private weighted sample. PWS maximizes the reporting probabilities of keys and estimation quality of a broad family of statistics. PWS improves over the state of the art also for the well-studied special case of private histograms, when no sampling is performed. We empirically demonstrate significant performance gains compared with prior baselines: 20%-300% increase in key reporting for common Zipfian frequency distributions and accuracy for $\times 2$-$ 8$ lower frequencies in estimation tasks. Moreover, PWS is applied as a simple post-processing of a non-private sample, without requiring the original data. This allows for seamless integration with existing implementations of non-private schemes and retaining the efficiency of schemes designed for resource-constrained settings such as massive distributed or streamed data. We believe that due to practicality and performance, PWS may become a method of choice in applications where privacy is desired.

翻译：通用数据集具有关键要素的形式(如交易和产品),目标是对关键和频对组合的组合形式进行分析。按(函数)频率加权键样本是一个高度多功能的概要,提供一套稀少的代表性关键,支持对查询统计的近似评价。我们建议私人加权抽样:一种确保元素级差异隐私的方法,同时尽可能保留相关非私人加权样本的实用性。PWS尽量扩大广泛统计系列的关键和估计质量的可靠性和估计性能报告概率。PWS还改进了对精细研究的私人直方图特殊案例的先进程度,而没有进行抽样。我们从经验上表明,与以往基线相比,业绩有显著提高:通用Zipfian频率分布和准确度的主要报告增加了20%至300%,同时在估算任务中保留了2美元至8美元的低频率。此外,PWSS是作为非私人抽样的简单后处理,不需要进行精细的私人直图特殊案例,在不进行抽样时,在不进行取样的情况下,在不进行抽样调查的情况下,我们可以将现有的数据流流化的方法纳入。

0

相关内容

Weight

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

4+阅读 · 2019年9月10日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

Topological Pilot Assignment in Large-Scale Distributed MIMO Networks

Arxiv

0+阅读 · 2021年5月26日

Differentially Private Frequency Moments Estimation with Polylogarithmic Space

Arxiv

0+阅读 · 2021年5月26日

A priori and a posteriori error analysis of an unfitted HDG method for semi-linear elliptic problems

Arxiv

0+阅读 · 2021年5月26日

Design of phase III trials with long-term survival outcomes based on short-term binary results

Arxiv

0+阅读 · 2021年5月24日

Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme

Arxiv

0+阅读 · 2021年5月23日

Estimating leverage scores via rank revealing methods and randomization

Arxiv

0+阅读 · 2021年5月23日

Privacy Amplification Via Bernoulli Sampling

Arxiv

0+阅读 · 2021年5月21日

Clustered Sampling: Low-Variance and Improved Representativity for Clients Selection in Federated Learning

Arxiv

0+阅读 · 2021年5月21日

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Arxiv

0+阅读 · 2021年5月21日

Federated Model Distillation with Noise-Free Differential Privacy

Arxiv

0+阅读 · 2021年5月21日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

《算法凸几何》简明书，Algorithmic Convex Geometry，50页pdf

专知会员服务

42+阅读 · 2021年4月2日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

知识图谱推理，50页ppt，Salesforce首席科学家Richard Socher

专知会员服务

111+阅读 · 2020年6月10日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

【AAAI2020论文】隐私保留GBDT（Privacy-Preserving Gradient Boosting Decision Trees）

专知会员服务

36+阅读 · 2019年11月15日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025教程】人类–AI 对齐：基础、方法、实践与挑战

中文版《未来战争：杀伤链优势与俄乌战争启示》报告

中国信通院规划所发布《人工智能算力基础设施赋能研究报告（2025年）》

人机编队将赢得未来战争

相关资讯

已删除

将门创投

4+阅读 · 2019年9月10日

动物脑的好奇心和强化学习的好奇心

动物脑的好奇心和强化学习的好奇心

CreateAMind

10+阅读 · 2019年1月26日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

相关论文

Topological Pilot Assignment in Large-Scale Distributed MIMO Networks

Arxiv

0+阅读 · 2021年5月26日

Differentially Private Frequency Moments Estimation with Polylogarithmic Space

Arxiv

0+阅读 · 2021年5月26日

A priori and a posteriori error analysis of an unfitted HDG method for semi-linear elliptic problems

Arxiv

0+阅读 · 2021年5月26日

Design of phase III trials with long-term survival outcomes based on short-term binary results

Arxiv

0+阅读 · 2021年5月24日

Quantile Multi-Armed Bandits: Optimal Best-Arm Identification and a Differentially Private Scheme

Arxiv

0+阅读 · 2021年5月23日

Estimating leverage scores via rank revealing methods and randomization

Arxiv

0+阅读 · 2021年5月23日

Privacy Amplification Via Bernoulli Sampling

Arxiv

0+阅读 · 2021年5月21日

Clustered Sampling: Low-Variance and Improved Representativity for Clients Selection in Federated Learning

Arxiv

0+阅读 · 2021年5月21日

LDP-FL: Practical Private Aggregation in Federated Learning with Local Differential Privacy

Arxiv

0+阅读 · 2021年5月21日

Federated Model Distillation with Noise-Free Differential Privacy

Arxiv

0+阅读 · 2021年5月21日

微信扫码咨询专知VIP会员