RAB: 对付后门攻击的可证实的强力 (RAB: Provable Robustness Against Backdoor Attacks) - 专知论文

会员服务 ·

0

稳健性 · Learning · MoDELS · Machine Learning · 平滑 ·

2022 年 8 月 23 日

RAB: Provable Robustness Against Backdoor Attacks

翻译：RAB: 对付后门攻击的可证实的强力

Maurice Weber,Xiaojun Xu,Bojan Karlaš,Ce Zhang,Bo Li

from arxiv, IEEE Symposium on Security and Privacy 2023

Recent studies have shown that deep neural networks (DNNs) are vulnerable to adversarial attacks, including evasion and backdoor (poisoning) attacks. On the defense side, there have been intensive efforts on improving both empirical and provable robustness against evasion attacks; however, the provable robustness against backdoor attacks still remains largely unexplored. In this paper, we focus on certifying the machine learning model robustness against general threat models, especially backdoor attacks. We first provide a unified framework via randomized smoothing techniques and show how it can be instantiated to certify the robustness against both evasion and backdoor attacks. We then propose the first robust training process, RAB, to smooth the trained model and certify its robustness against backdoor attacks. We prove the robustness bound for machine learning models trained with RAB and prove that our robustness bound is tight. In addition, we theoretically show that it is possible to train the robust smoothed models efficiently for simple models such as K-nearest neighbor classifiers, and we propose an exact smooth-training algorithm that eliminates the need to sample from a noise distribution for such models. Empirically, we conduct comprehensive experiments for different machine learning (ML) models such as DNNs, support vector machines, and K-NN models on MNIST, CIFAR-10, and ImageNette datasets and provide the first benchmark for certified robustness against backdoor attacks. In addition, we evaluate K-NN models on a spambase tabular dataset to demonstrate the advantages of the proposed exact algorithm. Both the theoretic analysis and the comprehensive evaluation on diverse ML models and datasets shed light on further robust learning strategies against general training time attacks.

翻译：最近的研究显示,深层神经网络(DNNS)很容易受到对抗性攻击,包括逃险和后门(潜伏)攻击。在国防方面,我们一直在大力改进对逃避攻击的经验和可证实的稳健性;然而,对后门攻击的可证实的稳健性在很大程度上仍未探索。在本文件中,我们的重点是证明机器学习模型的稳健性,以对付一般威胁模型,特别是后门攻击。我们首先通过随机化平滑技术提供一个统一框架,并展示如何即时验证对逃避和后门攻击的稳健性。然后,我们提议开展首次强健的培训进程,即RAB,以平滑性模型和后门攻击的稳健性;我们证明机器与RAB的机机学习模型的稳健健健性。我们理论上显示,有可能为KNFAR的首次模型培训稳健健健性模型,我们提议进行精确的模拟,我们提议进行精确的升级的算法,以排除对低噪音攻击的样品进行抽样分析,KNMLMAR数据,我们为不同的模型进行模拟的模拟学习。

0

相关内容

稳健性

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

利用植物病毒表达载体来筛选玉米黏虫有效干扰RNA的研究

国家自然科学基金

0+阅读 · 2014年12月31日

阻断间充质干细胞构造转移灶肿瘤微环境减少结直肠癌术后肝转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Ag离子注入改进微晶及超纳米金刚石薄膜的微结构与电学性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

ATP激活血管内皮细胞P2Y2受体趋化巡逻型单核细胞稳定动脉粥样硬化斑块

国家自然科学基金

0+阅读 · 2013年12月31日

用于生物检测与成像的AIE型红光纳米材料

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

用于生物标记及荧光成像dendrimer分子的多光子上转换荧光发射及荧光共振能量传递

国家自然科学基金

0+阅读 · 2011年12月31日

多相介质中瑞雷面波传播特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

铽、铈激活含Ba(Gd,Y)F5纳米晶闪烁微晶玻璃的制备和发光机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

脂筏-双氧化酶信号平台在气道上皮细胞防御反应及介导上皮损伤中的作用和机制

国家自然科学基金

0+阅读 · 2008年12月31日

Practical Adversarial Attacks on Spatiotemporal Traffic Forecasting Models

Arxiv

0+阅读 · 2022年10月5日

On the Robustness of Deep Clustering Models: Adversarial Attacks and Defenses

Arxiv

0+阅读 · 2022年10月4日

Invariant Aggregator for Defending Federated Backdoor Attacks

Arxiv

0+阅读 · 2022年10月4日

Root and community inference on the latent growth process of a network using noisy attachment models

Arxiv

0+阅读 · 2022年10月4日

Provable Guarantees against Data Poisoning Using Self-Expansion and Compatibility

Provable Guarantees against Data Poisoning Using Self-Expansion and Compatibility

Arxiv

0+阅读 · 2022年10月4日

Membership Inference Attacks Against Text-to-image Generation Models

Arxiv

0+阅读 · 2022年10月3日

FLCert: Provably Secure Federated Learning against Poisoning Attacks

Arxiv

0+阅读 · 2022年10月2日

Availability Attacks Against Neural Network Certifiers Based on Backdoors

Arxiv

0+阅读 · 2022年10月2日

Provable Defense Against Geometric Transformations

Provable Defense Against Geometric Transformations

Arxiv

0+阅读 · 2022年9月30日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

VIP会员

文章信息

相关主题

Machine Learning

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

33页PPT【AI+天气预测】，AI and Machine learning for weather predictions

专知会员服务

34+阅读 · 2022年3月5日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

人工智能如何用于抵抗COVID-19？Mila这份《AI against COVID-19 》PPT

专知会员服务

48+阅读 · 2020年5月17日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机战争时代的战时法：大国竞争中的区分原则、相称性原则与行动建议》最新75页

《构建强健军事力量的设计挑战：提升海军兵力支持系统效能的多分辨率建模方法》69页

正视无人机心理战：恐惧效应与战略反思

《精确反蜂群防御系统：三维运动探测与定向空爆拦截技术融合》最新24页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Practical Adversarial Attacks on Spatiotemporal Traffic Forecasting Models

Arxiv

0+阅读 · 2022年10月5日

On the Robustness of Deep Clustering Models: Adversarial Attacks and Defenses

Arxiv

0+阅读 · 2022年10月4日

Invariant Aggregator for Defending Federated Backdoor Attacks

Arxiv

0+阅读 · 2022年10月4日

Root and community inference on the latent growth process of a network using noisy attachment models

Arxiv

0+阅读 · 2022年10月4日

Provable Guarantees against Data Poisoning Using Self-Expansion and Compatibility

Provable Guarantees against Data Poisoning Using Self-Expansion and Compatibility

Arxiv

0+阅读 · 2022年10月4日

Membership Inference Attacks Against Text-to-image Generation Models

Arxiv

0+阅读 · 2022年10月3日

FLCert: Provably Secure Federated Learning against Poisoning Attacks

Arxiv

0+阅读 · 2022年10月2日

Availability Attacks Against Neural Network Certifiers Based on Backdoors

Arxiv

0+阅读 · 2022年10月2日

Provable Defense Against Geometric Transformations

Provable Defense Against Geometric Transformations

Arxiv

0+阅读 · 2022年9月30日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

相关基金

利用植物病毒表达载体来筛选玉米黏虫有效干扰RNA的研究

国家自然科学基金

0+阅读 · 2014年12月31日

阻断间充质干细胞构造转移灶肿瘤微环境减少结直肠癌术后肝转移的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Ag离子注入改进微晶及超纳米金刚石薄膜的微结构与电学性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

ATP激活血管内皮细胞P2Y2受体趋化巡逻型单核细胞稳定动脉粥样硬化斑块

国家自然科学基金

0+阅读 · 2013年12月31日

用于生物检测与成像的AIE型红光纳米材料

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

用于生物标记及荧光成像dendrimer分子的多光子上转换荧光发射及荧光共振能量传递

国家自然科学基金

0+阅读 · 2011年12月31日

多相介质中瑞雷面波传播特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

铽、铈激活含Ba(Gd,Y)F5纳米晶闪烁微晶玻璃的制备和发光机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

脂筏-双氧化酶信号平台在气道上皮细胞防御反应及介导上皮损伤中的作用和机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员