大型国家劳工计划模式的高效DP-SGD机制 (An Efficient DP-SGD Mechanism for Large Scale NLP Models)

Recent advances in deep learning have drastically improved performance on many Natural Language Understanding (NLU) tasks. However, the data used to train NLU models may contain private information such as addresses or phone numbers, particularly when drawn from human subjects. It is desirable that underlying models do not expose private information contained in the training data. Differentially Private Stochastic Gradient Descent (DP-SGD) has been proposed as a mechanism to build privacy-preserving models. However, DP-SGD can be prohibitively slow to train. In this work, we propose a more efficient DP-SGD for training using a GPU infrastructure and apply it to fine-tuning models based on LSTM and transformer architectures. We report faster training times, alongside accuracy, theoretical privacy guarantees and success of Membership inference attacks for our models and observe that fine-tuning with proposed variant of DP-SGD can yield competitive models without significant degradation in training time and improvement in privacy protection. We also make observations such as looser theoretical $\epsilon, \delta$ can translate into significant practical privacy gains.

翻译：最近深层次学习的进展大大改善了许多自然语言理解(NLU)任务的业绩,然而,用于培训NLU模型的数据可能包含私人信息,如地址或电话号码等,特别是从人的主题中提取的数据; 基础模型最好不暴露培训数据中所包含的私人信息; 提出了不同的私人小动物渐长后裔(DP-SGD),作为建立隐私保护模式的机制; 然而,DP-SGD培训的速度可能过于缓慢。在这项工作中,我们提议使用GPU基础设施进行培训时采用效率更高的DP-SGD,并将其应用到基于LSTM和变异结构的微调模型中。我们报告培训时间加快,加上准确性、理论隐私保障和会员身份攻击成功,我们发现对DP-SGD的拟议变型的微调可以产生竞争性模型,而不会显著降低培训时间和改进隐私保护。我们还提出一些意见,例如较宽松的理论 $\epslon,\delta$可以转化为显著的实际隐私收益。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

【CVPR2021】自监督几何感知

专知会员服务

46+阅读 · 2021年3月6日

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

136+阅读 · 2020年5月30日