Due to cost and time-to-market constraints, many industries outsource the training process of machine learning models (ML) to third-party cloud service providers, popularly known as ML-asa-Service (MLaaS). MLaaS creates opportunity for an adversary to provide users with backdoored ML models to produce incorrect predictions only in extremely rare (attacker-chosen) scenarios. Bayesian neural networks (BNN) are inherently immune against backdoor attacks since the weights are designed to be marginal distributions to quantify the uncertainty. In this paper, we propose a novel backdoor attack based on effective learning and targeted utilization of reverse distribution. This paper makes three important contributions. (1) To the best of our knowledge, this is the first backdoor attack that can effectively break the robustness of BNNs. (2) We produce reverse distributions to cancel the original distributions when the trigger is activated. (3) We propose an efficient solution for merging probability distributions in BNNs. Experimental results on diverse benchmark datasets demonstrate that our proposed attack can achieve the attack success rate (ASR) of 100%, while the ASR of the state-of-the-art attacks is lower than 60%.
翻译:由于成本和时间到市场的限制,许多行业将机器学习模式(ML)的培训过程外包给第三方云服务供应商,通称为ML-asa-Service(MLaaaS)。MLaaS为对手提供机会,让用户获得后门ML模型,只在极为罕见的情况下(攻击者选择)才能作出不正确的预测。Bayesian神经网络(BNN)本质上不受后门攻击的影响,因为重量设计是用来量化不确定性的边缘分布。在本文中,我们提议以有效学习和有针对性地利用反向分布为基础,进行新的后门攻击。本文作出了三项重要贡献。(1) 据我们所知,这是第一次能够有效打破BNS坚固度的后门攻击。(2)我们制作反向分布,以便在启动触发点时取消原始分布。(3)我们提议一个有效的解决方案,将BNUS的概率分布合并起来。不同基准数据集的实验结果表明,我们提出的攻击可以达到100%的攻击成功率(ASR),而ASR是州攻击率低于60。