与保护隐私BERT的自然语言理解 (Natural Language Understanding with Privacy-Preserving BERT) - 专知论文

会员服务 ·

0

NLU · 可理解性 · BERT · MINE · 语言模型化 ·

2021 年 8 月 19 日

Natural Language Understanding with Privacy-Preserving BERT

翻译：与保护隐私BERT的自然语言理解

Chen Qu,Weize Kong,Liu Yang,Mingyang Zhang,Michael Bendersky,Marc Najork

from arxiv, Accepted to CIKM 2021

Privacy preservation remains a key challenge in data mining and Natural Language Understanding (NLU). Previous research shows that the input text or even text embeddings can leak private information. This concern motivates our research on effective privacy preservation approaches for pretrained Language Models (LMs). We investigate the privacy and utility implications of applying dx-privacy, a variant of Local Differential Privacy, to BERT fine-tuning in NLU applications. More importantly, we further propose privacy-adaptive LM pretraining methods and show that our approach can boost the utility of BERT dramatically while retaining the same level of privacy protection. We also quantify the level of privacy preservation and provide guidance on privacy configuration. Our experiments and findings lay the groundwork for future explorations of privacy-preserving NLU with pretrained LMs.

翻译：保护隐私仍然是数据挖掘和自然语言理解(NLU)方面的一个关键挑战。以前的研究表明,输入文本甚至文字嵌入可能泄露私人信息。这一关切促使我们研究对预先培训的语言模式(LMs)采取有效的隐私保护方法。我们调查了应用dx-privaicy(地方差异隐私的变体)对NLU应用中BERT进行微调的隐私和效用影响。更重要的是,我们进一步提出了隐私适应性LM预培训方法,并表明我们的方法可以极大地提高BERT的效用,同时保留同样的隐私保护水平。我们还量化了隐私保护水平,并就隐私配置提供了指导。我们的实验和发现为今后探索使用预先培训的LMs来保护隐私的NLU奠定了基础。

0

相关内容

NLU

最新《元学习》研究进展报告与论文，DeepMind Jane Wang研究科学家为你讲解，附下载

最新《元学习》研究进展报告与论文，DeepMind Jane Wang研究科学家为你讲解，附下载

专知会员服务

47+阅读 · 2020年12月7日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

专知会员服务

35+阅读 · 2020年5月25日

元学习(meta learning) 最新进展综述论文

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

BERT进展2019四篇必读论文

BERT进展2019四篇必读论文

专知会员服务

69+阅读 · 2020年1月2日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

AHEAD: Adaptive Hierarchical Decomposition for Range Query under Local Differential Privacy

AHEAD: Adaptive Hierarchical Decomposition for Range Query under Local Differential Privacy

Arxiv

0+阅读 · 2021年10月14日

Differentially Private Fine-tuning of Language Models

Arxiv

0+阅读 · 2021年10月13日

Dict-BERT: Enhancing Language Model Pre-training with Dictionary

Arxiv

0+阅读 · 2021年10月13日

Federated Natural Language Generation for Personalized Dialogue System

Arxiv

0+阅读 · 2021年10月13日

FedGNN: Federated Graph Neural Network for Privacy-Preserving Recommendation

Arxiv

5+阅读 · 2021年2月9日

Privacy-Preserving Video Classification with Convolutional Neural Networks

Arxiv

8+阅读 · 2021年2月6日

Privacy-Preserving News Recommendation Model Learning

Privacy-Preserving News Recommendation Model Learning

Arxiv

6+阅读 · 2020年10月8日

Semantics-aware BERT for Language Understanding

Arxiv

4+阅读 · 2019年9月5日

A generic framework for privacy preserving deep learning

Arxiv

6+阅读 · 2018年11月13日

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Arxiv

3+阅读 · 2018年9月15日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

最新《元学习》研究进展报告与论文，DeepMind Jane Wang研究科学家为你讲解，附下载

最新《元学习》研究进展报告与论文，DeepMind Jane Wang研究科学家为你讲解，附下载

专知会员服务

47+阅读 · 2020年12月7日

【重磅】2021年IEEE Fellow出炉！ 282位新晋升会士！七十多位华人当选！

专知会员服务

23+阅读 · 2020年11月25日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

最近几种小样本元学习简明综述，A Concise Review of Recent Few-shot Meta-learning Methods

专知会员服务

35+阅读 · 2020年5月25日

元学习(meta learning) 最新进展综述论文

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

【Google ICLR2020论文】嵌入式大规模检索的预训练任务，Pre-training Tasks for Embedding-based Large-scale Retrieval

专知会员服务

28+阅读 · 2020年2月12日

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

【微软研究院】IMAGEBERT: CROSS-MODAL PRE-TRAINING WITH LARGE-SCALE WEAK-SUPERVISED IMAGE-TEXT DATA

专知会员服务

43+阅读 · 2020年1月28日

BERT进展2019四篇必读论文

BERT进展2019四篇必读论文

专知会员服务

69+阅读 · 2020年1月2日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

AHEAD: Adaptive Hierarchical Decomposition for Range Query under Local Differential Privacy

AHEAD: Adaptive Hierarchical Decomposition for Range Query under Local Differential Privacy

Arxiv

0+阅读 · 2021年10月14日

Differentially Private Fine-tuning of Language Models

Arxiv

0+阅读 · 2021年10月13日

Dict-BERT: Enhancing Language Model Pre-training with Dictionary

Arxiv

0+阅读 · 2021年10月13日

Federated Natural Language Generation for Personalized Dialogue System

Arxiv

0+阅读 · 2021年10月13日

FedGNN: Federated Graph Neural Network for Privacy-Preserving Recommendation

Arxiv

5+阅读 · 2021年2月9日

Privacy-Preserving Video Classification with Convolutional Neural Networks

Arxiv

8+阅读 · 2021年2月6日

Privacy-Preserving News Recommendation Model Learning

Privacy-Preserving News Recommendation Model Learning

Arxiv

6+阅读 · 2020年10月8日

Semantics-aware BERT for Language Understanding

Arxiv

4+阅读 · 2019年9月5日

A generic framework for privacy preserving deep learning

Arxiv

6+阅读 · 2018年11月13日

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Improving Natural Language Inference Using External Knowledge in the Science Questions Domain

Arxiv

3+阅读 · 2018年9月15日

微信扫码咨询专知VIP会员