We present a Federated Learning (FL) based solution for building a distributed classifier capable of detecting URLs containing GDPR-sensitive content related to categories such as health, sexual preference, political beliefs, etc. Although such a classifier addresses the limitations of previous offline/centralised classifiers,it is still vulnerable to poisoning attacks from malicious users that may attempt to reduce the accuracy for benign users by disseminating faulty model updates. To guard against this, we develop a robust aggregation scheme based on subjective logic and residual-based attack detection. Employing a combination of theoretical analysis, trace-driven simulation, as well as experimental validation with a prototype and real users, we show that our classifier can detect sensitive content with high accuracy, learn new labels fast, and remain robust in view of poisoning attacks from malicious users, as well as imperfect input from non-malicious ones.
翻译:我们提出了一个基于联邦学习(FL)的解决方案,用于建立一个分布式分类器,能够检测含有与健康、性偏好、政治信仰等类别相关的GDPR敏感内容的URL的分布式分类器。 虽然这种分类器解决了先前的离线/集中分类器的局限性,但仍然容易受到恶意用户的毒害袭击,这些袭击可能通过传播错误的模型更新来降低良性用户的准确性。为了防范这种情况,我们根据主观逻辑和残余袭击检测,制定了一个强有力的汇总计划。我们采用了理论分析、追踪模拟以及实验验证与原型和真实用户的结合,我们表明我们的分类器能够以高精度检测敏感内容,快速学习新标签,并鉴于恶意用户的中毒袭击以及非恶意用户的不完善投入,保持稳健。