Federated Learning (FL) is a scheme for collaboratively training Deep Neural Networks (DNNs) with multiple data sources from different clients. Instead of sharing the data, each client trains the model locally, resulting in improved privacy. However, recently so-called targeted poisoning attacks have been proposed that allow individual clients to inject a backdoor into the trained model. Existing defenses against these backdoor attacks either rely on techniques like Differential Privacy to mitigate the backdoor, or analyze the weights of the individual models and apply outlier detection methods that restricts these defenses to certain data distributions. However, adding noise to the models' parameters or excluding benign outliers might also reduce the accuracy of the collaboratively trained model. Additionally, allowing the server to inspect the clients' models creates a privacy risk due to existing knowledge extraction methods. We propose \textit{CrowdGuard}, a model filtering defense, that mitigates backdoor attacks by leveraging the clients' data to analyze the individual models before the aggregation. To prevent data leaks, the server sends the individual models to secure enclaves, running in client-located Trusted Execution Environments. To effectively distinguish benign and poisoned models, even if the data of different clients are not independently and identically distributed (non-IID), we introduce a novel metric called \textit{HLBIM} to analyze the outputs of the DNN's hidden layers. We show that the applied significance-based detection algorithm combined can effectively detect poisoned models, even in non-IID scenarios.
翻译:联邦学习联合会(FL) 是一个合作培训深神经网络(DNN) 的计划, 由来自不同客户的多个数据源提供多种数据源。 但是, 每位客户不是共享数据, 而是在当地培训模型, 从而改善隐私。 然而, 最近提出了所谓的定点中毒袭击, 让个别客户将后门注入经过培训的模型。 针对这些幕后攻击的现有防御要么依靠差异隐私等技术来缓解后门攻击, 要么分析单个模型的重量, 并应用外部检测方法, 将这些防御限制在某些数据分布上。 但是, 增加模型参数的噪音或排除良性外端点, 反而会降低经过协作训练的模式的准确性。 此外, 允许服务器检查客户模式会因现有的知识提取方法而产生隐私风险。 我们提议了\ textitleitit{CrowdGuard}, 一种过滤防御模式, 通过利用客户的数据在汇总前分析单个模型, 来减轻后门攻击。 为了防止数据泄漏, 服务器将单个模型发送到安全的飞地,, 甚至将客户定位为不固定的客户端执行 D- II 版本, 如果 独立地展示了我们所定义的 D 数据库,, 可以有效地区分了不同的数据和新版本。