渐变性攻击让联邦学习变得不安全吗? (Do Gradient Inversion Attacks Make Federated Learning Unsafe?)

Federated learning (FL) allows the collaborative training of AI models without needing to share raw data. This capability makes it especially interesting for healthcare applications where patient and data privacy is of utmost concern. However, recent works on the inversion of deep neural networks from model gradients raised concerns about the security of FL in preventing the leakage of training data. In this work, we show that these attacks presented in the literature are impractical in real FL use-cases and provide a new baseline attack that works for more realistic scenarios where the clients' training involves updating the Batch Normalization (BN) statistics. Furthermore, we present new ways to measure and visualize potential data leakage in FL. Our work is a step towards establishing reproducible methods of measuring data leakage in FL and could help determine the optimal tradeoffs between privacy-preserving techniques, such as differential privacy, and model accuracy based on quantifiable metrics.

翻译：联邦学习(FL)允许在无需分享原始数据的情况下对AI模型进行合作培训,这种能力使得在病人和数据隐私最令人关切的保健应用方面特别有趣;然而,最近关于将深神经网络从模型梯度中倒转的工作引起了人们对FL在防止培训数据泄漏方面的安全性的关切;在这项工作中,我们表明,文献中介绍的这些攻击在实际FL使用案例中是不切实际的,并提供新的基线攻击,在客户培训涉及更新批次正常化(BN)统计数据的更现实的情景下发挥作用;此外,我们提出了衡量和可视化FL中潜在数据渗漏的新方法。我们的工作是朝着建立可复制的测量FL数据渗漏的方法迈出的一步,有助于确定隐私保护技术(如差异隐私)和基于量化指标的模型精度之间的最佳权衡。

相关内容

联邦学习

关注 200

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日