以实用方法处理保护隐私的文本生成问题 (On a Utilitarian Approach to Privacy Preserving Text Generation) - 专知论文

会员服务 ·

0

近邻 · 噪声 · 原点 · 约束优化 · state-of-the-art ·

2021 年 4 月 23 日

On a Utilitarian Approach to Privacy Preserving Text Generation

翻译：以实用方法处理保护隐私的文本生成问题

Zekun Xu,Abhinav Aggarwal,Oluwaseyi Feyisetan,Nathanael Teissier

from arxiv, 10 pages, 3 figures

Differentially-private mechanisms for text generation typically add carefully calibrated noise to input words and use the nearest neighbor to the noised input as the output word. When the noise is small in magnitude, these mechanisms are susceptible to reconstruction of the original sensitive text. This is because the nearest neighbor to the noised input is likely to be the original input. To mitigate this empirical privacy risk, we propose a novel class of differentially private mechanisms that parameterizes the nearest neighbor selection criterion in traditional mechanisms. Motivated by Vickrey auction, where only the second highest price is revealed and the highest price is kept private, we balance the choice between the first and the second nearest neighbors in the proposed class of mechanisms using a tuning parameter. This parameter is selected by empirically solving a constrained optimization problem for maximizing utility, while maintaining the desired privacy guarantees. We argue that this empirical measurement framework can be used to align different mechanisms along a common benchmark for their privacy-utility tradeoff, particularly when different distance metrics are used to calibrate the amount of noise added. Our experiments on real text classification datasets show up to 50% improvement in utility compared to the existing state-of-the-art with the same empirical privacy guarantee.

翻译：用于文本生成的不同私人机制通常会为输入单词添加经过仔细校准的噪音,并将最近的邻居作为输出单词使用。当噪声数量小时, 这些机制很容易重塑原始敏感文本。这是因为最近的独家机制很可能是原始输入的原始输入。为了减轻这种经验隐私风险, 我们提议了一种新的不同私人机制类别, 将最近邻居的选择标准在传统机制中参数化。受Vickrey 拍卖的驱动, 只有第二高的价格被披露, 最高的价格被保密, 我们平衡了拟议机制类别中第一个和第二近邻的选择, 使用调频参数。这个参数是通过实验性地解决一个有限的优化问题, 以最大限度地发挥效用, 同时维护理想的隐私保障。我们主张, 这个经验性衡量框架可以用来将不同机制与隐私使用的共同基准相匹配, 特别是当使用不同的距离测量来校准噪声量时, 我们关于实际文本分类数据集的实验显示, 与现有的国家隐私保障相比, 效用改善到50% 。

0

相关内容

深度学习模型可解释性的研究进展

专知会员服务

223+阅读 · 2020年8月1日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

专知会员服务

36+阅读 · 2020年3月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

已删除

将门创投

7+阅读 · 2018年11月5日

Differentially Private Quantiles

Differentially Private Quantiles

Arxiv

0+阅读 · 2021年6月15日

On Polynomial Approximations for Privacy-Preserving and Verifiable ReLU Networks

Arxiv

0+阅读 · 2021年6月15日

Budget Sharing for Multi-Analyst Differential Privacy

Arxiv

0+阅读 · 2021年6月12日

Generalized Moving Peaks Benchmark

Arxiv

0+阅读 · 2021年6月11日

GraphMI: Extracting Private Graph Data from Graph Neural Networks

Arxiv

1+阅读 · 2021年6月5日

Privacy-Preserving Graph Convolutional Networks for Text Classification

Arxiv

0+阅读 · 2021年2月10日

FedGNN: Federated Graph Neural Network for Privacy-Preserving Recommendation

Arxiv

5+阅读 · 2021年2月9日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

A generic framework for privacy preserving deep learning

Arxiv

6+阅读 · 2018年11月13日

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning

Arxiv

4+阅读 · 2017年10月26日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

深度学习模型可解释性的研究进展

专知会员服务

223+阅读 · 2020年8月1日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

【论文推荐】保护隐私的协同过滤综述，Survey of Privacy-Preserving Collaborative Filtering

专知会员服务

36+阅读 · 2020年3月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

已删除

将门创投

7+阅读 · 2018年11月5日

相关论文

Differentially Private Quantiles

Differentially Private Quantiles

Arxiv

0+阅读 · 2021年6月15日

On Polynomial Approximations for Privacy-Preserving and Verifiable ReLU Networks

Arxiv

0+阅读 · 2021年6月15日

Budget Sharing for Multi-Analyst Differential Privacy

Arxiv

0+阅读 · 2021年6月12日

Generalized Moving Peaks Benchmark

Arxiv

0+阅读 · 2021年6月11日

GraphMI: Extracting Private Graph Data from Graph Neural Networks

Arxiv

1+阅读 · 2021年6月5日

Privacy-Preserving Graph Convolutional Networks for Text Classification

Arxiv

0+阅读 · 2021年2月10日

FedGNN: Federated Graph Neural Network for Privacy-Preserving Recommendation

Arxiv

5+阅读 · 2021年2月9日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

A generic framework for privacy preserving deep learning

Arxiv

6+阅读 · 2018年11月13日

A Unified approach for Conventional Zero-shot, Generalized Zero-shot and Few-shot Learning

Arxiv

4+阅读 · 2017年10月26日

微信扫码咨询专知VIP会员