RecUP-FL: 通过用户可配置隐私防御方式在联邦学习中协调效用与隐私 (RecUP-FL: Reconciling Utility and Privacy in Federated Learning via User-configurable Privacy Defense)

Federated learning (FL) provides a variety of privacy advantages by allowing clients to collaboratively train a model without sharing their private data. However, recent studies have shown that private information can still be leaked through shared gradients. To further minimize the risk of privacy leakage, existing defenses usually require clients to locally modify their gradients (e.g., differential privacy) prior to sharing with the server. While these approaches are effective in certain cases, they regard the entire data as a single entity to protect, which usually comes at a large cost in model utility. In this paper, we seek to reconcile utility and privacy in FL by proposing a user-configurable privacy defense, RecUP-FL, that can better focus on the user-specified sensitive attributes while obtaining significant improvements in utility over traditional defenses. Moreover, we observe that existing inference attacks often rely on a machine learning model to extract the private information (e.g., attributes). We thus formulate such a privacy defense as an adversarial learning problem, where RecUP-FL generates slight perturbations that can be added to the gradients before sharing to fool adversary models. To improve the transferability to un-queryable black-box adversary models, inspired by the idea of meta-learning, RecUP-FL forms a model zoo containing a set of substitute models and iteratively alternates between simulations of the white-box and the black-box adversarial attack scenarios to generate perturbations. Extensive experiments on four datasets under various adversarial settings (both attribute inference attack and data reconstruction attack) show that RecUP-FL can meet user-specified privacy constraints over the sensitive attributes while significantly improving the model utility compared with state-of-the-art privacy defenses.

翻译：联邦学习提供了一种通过协作训练模型而不分享私密数据的方法，从而提供了各种隐私优势。然而，最近的研究表明，虽然不共享数据，但通过共享梯度还是可能泄漏私有信息。为了进一步最小化隐私泄露的风险，现有的防御通常需要客户端在与服务器共享梯度之前局部修改其梯度（如差分隐私）。虽然这些方法在某些情况下非常有效，但它们视整个数据为一项要保护的单个实体，这通常会以模型效用的大量成本为代价。在这篇文章中，我们通过提出一种名为RecUP-FL的用户可配置隐私防御方法来在联邦学习中协调效用与隐私，该方法可以更好地关注用户指定的敏感属性，同时在传统防御方案的基础上显著提高效用。此外，我们观察到现有的推理攻击通常依赖于机器学习模型来提取私有信息（例如属性）。因此，我们将此类隐私防御形式化为对抗性学习问题，其中RecUP-FL生成微小的扰动，可在共享前添加到梯度中以欺骗恶意模型。为了提高对不可查询黑盒对手模型的可转移性，灵感来自元学习的思想，RecUP-FL形成了一个包含一组替代模型的模型动物园，并在模拟白盒和黑盒对抗攻击场景之间交替进行来生成扰动。在各种对抗设置下（包括属性推断攻击和数据重建攻击）的四个数据集上进行的广泛实验表明，与现有的最先进隐私防御相比，RecUP-FL能够满足用户指定的敏感属性隐私约束，同时显著提高模型效用。