关闭大门:根据客户-深层层输出分析,检测联邦学习中的后门模式 (Close the Gate: Detecting Backdoored Models in Federated Learning based on Client-Side Deep Layer Output Analysis)

Federated Learning (FL) is a scheme for collaboratively training Deep Neural Networks (DNNs) with multiple data sources from different clients. Instead of sharing the data, each client trains the model locally, resulting in improved privacy. However, recently so-called targeted poisoning attacks have been proposed that allow individual clients to inject a backdoor into the trained model. Existing defenses against these backdoor attacks either rely on techniques like Differential Privacy to mitigate the backdoor, or analyze the weights of the individual models and apply outlier detection methods that restricts these defenses to certain data distributions. However, adding noise to the models' parameters or excluding benign outliers might also reduce the accuracy of the collaboratively trained model. Additionally, allowing the server to inspect the clients' models creates a privacy risk due to existing knowledge extraction methods. We propose \textit{CrowdGuard}, a model filtering defense, that mitigates backdoor attacks by leveraging the clients' data to analyze the individual models before the aggregation. To prevent data leaks, the server sends the individual models to secure enclaves, running in client-located Trusted Execution Environments. To effectively distinguish benign and poisoned models, even if the data of different clients are not independently and identically distributed (non-IID), we introduce a novel metric called \textit{HLBIM} to analyze the outputs of the DNN's hidden layers. We show that the applied significance-based detection algorithm combined can effectively detect poisoned models, even in non-IID scenarios.

翻译：联邦学习联合会(FL) 是一个合作培训深神经网络(DNN) 的计划, 由来自不同客户的多个数据源提供多种数据源。但是, 每位客户不是共享数据, 而是在当地培训模型, 从而改善隐私。然而, 最近提出了所谓的定点中毒袭击, 让个别客户将后门注入经过培训的模型。针对这些幕后攻击的现有防御要么依靠差异隐私等技术来缓解后门攻击, 要么分析单个模型的重量, 并应用外部检测方法, 将这些防御限制在某些数据分布上。但是, 增加模型参数的噪音或排除良性外端点, 反而会降低经过协作训练的模式的准确性。此外, 允许服务器检查客户模式会因现有的知识提取方法而产生隐私风险。我们提议了\ textitleitit{CrowdGuard}, 一种过滤防御模式, 通过利用客户的数据在汇总前分析单个模型, 来减轻后门攻击。为了防止数据泄漏, 服务器将单个模型发送到安全的飞地,, 甚至将客户定位为不固定的客户端执行 D- II 版本, 如果独立地展示了我们所定义的 D 数据库,, 可以有效地区分了不同的数据和新版本。

相关内容

MoDELS

关注 44

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日