获得保障:对经证明的强力的浮动点攻击 (Getting a-Round Guarantees: Floating-Point Attacks on Certified Robustness) - 专知论文

会员服务 ·

0

线性的 · 稳健性 · MNIST (数据集) · Networking · 样例 ·

2023 年 2 月 2 日

Getting a-Round Guarantees: Floating-Point Attacks on Certified Robustness

翻译：获得保障:对经证明的强力的浮动点攻击

Jiankai Jin,Olga Ohrimenko,Benjamin I. P. Rubinstein

Adversarial examples pose a security risk as they can alter decisions of a machine learning classifier through slight input perturbations. Certified robustness has been proposed as a mitigation where given an input $x$, a classifier returns a prediction and a radius with a provable guarantee that any perturbation to $x$ within this radius (e.g., under the $L_2$ norm) will not alter the classifier's prediction. In this work, we show that these guarantees can be invalidated due to limitations of floating-point representation that cause rounding errors. We design a rounding search method that can efficiently exploit this vulnerability to find adversarial examples within the certified radius. We show that the attack can be carried out against several linear classifiers that have exact certifiable guarantees and against neural networks with ReLU activations that have conservative certifiable guarantees. Our experiments demonstrate attack success rates over 50% on random linear classifiers, up to 23.24% on the MNIST dataset for linear SVM, and up to 15.83% on the MNIST dataset for a neural network whose certified radius was given by a verifier based on mixed integer programming. Finally, as a mitigation, we advocate the use of rounded interval arithmetic to account for rounding errors.

翻译：Aversari 示例构成一种安全风险,因为它们可以通过轻微的输入扰动来改变机器学习分类师的决定。在给一个输入值为$x$的情况下,建议了经过认证的稳健度,作为缓解措施。在给一个输入值为$x$的情况下,一个分类员返回了一个预测和一个半径,并可以确认保证在这个半径内,任何扰动到$x$x美元(例如,根据$L_2的规范)不会改变分类师的预测。在这项工作中,我们证明这些担保可以无效,因为浮动点代表制的限制导致四舍五入错误。我们设计了一个四舍五入的搜索方法,可以有效地利用这种脆弱性来在经认证的半径内找到对抗性实例。我们表明,袭击可以针对几个有精确验证保证的线性分类师进行预测,也可以针对有保守的验证性保证的神经网络。我们的实验表明,随机线性分类师的进攻成功率超过50%,对于线性SVM的MNIST数据集高达23.24%,而MISD数据设置的15.83%用于神经网络的神经学数据库数据设置,而我们最后的算算为经认证的日历的模拟的模拟校验算。

0

相关内容

线性的

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

专知会员服务

38+阅读 · 2023年2月7日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

专知会员服务

36+阅读 · 2019年11月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

GPC3调控肝细胞癌侵袭和转移的信号通路研究

国家自然科学基金

0+阅读 · 2014年12月31日

电力设备tanδ在线监测中的信号去噪

国家自然科学基金

0+阅读 · 2013年12月31日

流体压缩性对微机电系统封装喷射泵点胶特性的影响机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

光纤激光传感阵列中的相干坍塌及抑制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

雷帕霉素通过Foxo3调节DCs耐受性抑制移植后肿瘤生长的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新因子hARAP3在AR介导基因转录调控及前列腺癌中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

高亮度、窄线宽激光二极管阵列

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

How many dimensions are required to find an adversarial example?

Arxiv

0+阅读 · 2023年3月24日

Feature Separation and Recalibration for Adversarial Robustness

Arxiv

0+阅读 · 2023年3月24日

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

Arxiv

0+阅读 · 2023年3月23日

LearnedFTL: A Learning-based Page-level FTL for Improving Random Reads in Flash-based SSDs

Arxiv

0+阅读 · 2023年3月23日

Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure

Arxiv

0+阅读 · 2023年3月23日

Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval

Arxiv

0+阅读 · 2023年3月22日

Membership Inference Attacks against Diffusion Models

Arxiv

0+阅读 · 2023年3月22日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

VIP会员

文章信息

相关主题

MNIST (数据集)

相关VIP内容

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

不可错过！CMU《机器学习导论》2023课程，Matt Gormley带队讲授，附Slides

专知会员服务

38+阅读 · 2023年2月7日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

【NeurIPS2019】基于累加噪声的对抗鲁棒性（Certified Adversarial Robustness with Additive Noise），Changyou Chen

专知会员服务

36+阅读 · 2019年11月12日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

How many dimensions are required to find an adversarial example?

Arxiv

0+阅读 · 2023年3月24日

Feature Separation and Recalibration for Adversarial Robustness

Arxiv

0+阅读 · 2023年3月24日

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

A Closer Look at Model Adaptation using Feature Distortion and Simplicity Bias

Arxiv

0+阅读 · 2023年3月23日

LearnedFTL: A Learning-based Page-level FTL for Improving Random Reads in Flash-based SSDs

Arxiv

0+阅读 · 2023年3月23日

Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure

Arxiv

0+阅读 · 2023年3月23日

Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval

Arxiv

0+阅读 · 2023年3月22日

Membership Inference Attacks against Diffusion Models

Arxiv

0+阅读 · 2023年3月22日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

相关基金

GPC3调控肝细胞癌侵袭和转移的信号通路研究

国家自然科学基金

0+阅读 · 2014年12月31日

电力设备tanδ在线监测中的信号去噪

国家自然科学基金

0+阅读 · 2013年12月31日

流体压缩性对微机电系统封装喷射泵点胶特性的影响机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

光纤激光传感阵列中的相干坍塌及抑制方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

雷帕霉素通过Foxo3调节DCs耐受性抑制移植后肿瘤生长的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

新因子hARAP3在AR介导基因转录调控及前列腺癌中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

Dyrk1A调控CaMKⅡ#948;的可变剪接及其在心脏重构过程中的作用

国家自然科学基金

0+阅读 · 2009年12月31日

高亮度、窄线宽激光二极管阵列

国家自然科学基金

0+阅读 · 2009年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员