有效注册防卫,防止对图像分类者进行补补袭击 (Efficient Certified Defenses Against Patch Attacks on Image Classifiers) - 专知论文

会员服务 ·

0

图像分类器 · 稳健性 · Automator · Performer · MoDELS ·

2021 年 2 月 8 日

Efficient Certified Defenses Against Patch Attacks on Image Classifiers

翻译：有效注册防卫,防止对图像分类者进行补补袭击

Jan Hendrik Metzen,Maksym Yatsura

from arxiv, accepted at ICLR 2021

Adversarial patches pose a realistic threat model for physical world attacks on autonomous systems via their perception component. Autonomous systems in safety-critical domains such as automated driving should thus contain a fail-safe fallback component that combines certifiable robustness against patches with efficient inference while maintaining high performance on clean inputs. We propose BagCert, a novel combination of model architecture and certification procedure that allows efficient certification. We derive a loss that enables end-to-end optimization of certified robustness against patches of different sizes and locations. On CIFAR10, BagCert certifies 10.000 examples in 43 seconds on a single GPU and obtains 86% clean and 60% certified accuracy against 5x5 patches.

翻译：自动驾驶等安全关键领域的自主系统应包含一个故障安全后退部分,将可验证的稳健性与高效推断的补丁结合起来,同时保持高效的清洁投入的高效性能。我们建议采用BagCert,这是模型架构和认证程序的新型组合,可以有效认证。我们得出一个损失,可以对不同大小和地点的补丁进行端到端的经认证的稳健性优化。在CIFAR10上,BagCert在43秒内验证了单一GPU的10,000个实例,在5x5补丁中获得了86%的清洁和60%的认证准确性。

0

相关内容

图像分类器

图像分类器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

【领域对抗学习的低资源文本分类】Low-Resource Text Classification using Domain-Adversarial Learning

【领域对抗学习的低资源文本分类】Low-Resource Text Classification using Domain-Adversarial Learning

专知会员服务

23+阅读 · 2020年4月22日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

专知会员服务

24+阅读 · 2020年2月22日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

全球首例！Adversarial T-shirt让你在AI目标检测系统中隐身

全球首例！Adversarial T-shirt让你在AI目标检测系统中隐身

CVer

4+阅读 · 2020年7月7日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

谷歌开发EfficientNets，扩大CNN并与AutoML结合，效率提升10倍|一周AI最火论文

谷歌开发EfficientNets，扩大CNN并与AutoML结合，效率提升10倍|一周AI最火论文

大数据文摘

9+阅读 · 2019年6月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR 2019 | 图像压缩重建也能抵御对抗样本

CVPR 2019 | 图像压缩重建也能抵御对抗样本

计算机视觉life

3+阅读 · 2019年4月26日

计算机视觉的不同任务

计算机视觉的不同任务

专知

5+阅读 · 2018年8月27日

黑魔法防御术：Ian Goodfellow对抗样本研究现状与未来方向综述

黑魔法防御术：Ian Goodfellow对抗样本研究现状与未来方向综述

专知

29+阅读 · 2018年5月26日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

已删除

将门创投

7+阅读 · 2018年4月25日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

PatchGuard: A Provably Robust Defense against Adversarial Patches via Small Receptive Fields and Masking

Arxiv

0+阅读 · 2021年3月31日

How Robust are Randomized Smoothing based Defenses to Data Poisoning?

Arxiv

0+阅读 · 2021年3月30日

What Causes Optical Flow Networks to be Vulnerable to Physical Adversarial Attacks

Arxiv

0+阅读 · 2021年3月30日

Hidden Backdoor Attack against Semantic Segmentation Models

Arxiv

0+阅读 · 2021年3月30日

Optimal Transport as a Defense Against Adversarial Attacks

Arxiv

0+阅读 · 2021年3月30日

Lagrangian Objective Function Leads to Improved Unforeseen Attack Generalization in Adversarial Training

Arxiv

0+阅读 · 2021年3月29日

Understanding Robustness of Transformers for Image Classification

Arxiv

2+阅读 · 2021年3月26日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

A Baseline for Few-Shot Image Classification

Arxiv

7+阅读 · 2020年3月1日

DPatch: An Adversarial Patch Attack on Object Detectors

DPatch: An Adversarial Patch Attack on Object Detectors

Arxiv

4+阅读 · 2018年9月15日

VIP会员

文章信息

相关主题

图像分类器

相关VIP内容

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

【领域对抗学习的低资源文本分类】Low-Resource Text Classification using Domain-Adversarial Learning

【领域对抗学习的低资源文本分类】Low-Resource Text Classification using Domain-Adversarial Learning

专知会员服务

23+阅读 · 2020年4月22日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

GeoffreyHinton-ICML2020投稿论文-偏转对抗攻击 Deflecting Adversarial Attacks

专知会员服务

24+阅读 · 2020年2月22日

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

【Facebook AI】对抗性NLI:自然语言理解的新基准，Adversarial NLI: A New Benchmark for Natural Language Understanding

专知会员服务

11+阅读 · 2019年11月2日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《导航战测试床》报告

《用于全球导航卫星系统电子干扰检测与分类的人工智能模型》2025最新107页

《欧洲天空盾牌倡议：应对无人机饱和攻击与高超音速导弹的多层防空演进与挑战》报告

《以人工智能为基准推动现代后勤领域创新和生产力的军事经验》

相关资讯

全球首例！Adversarial T-shirt让你在AI目标检测系统中隐身

全球首例！Adversarial T-shirt让你在AI目标检测系统中隐身

CVer

4+阅读 · 2020年7月7日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

谷歌开发EfficientNets，扩大CNN并与AutoML结合，效率提升10倍|一周AI最火论文

谷歌开发EfficientNets，扩大CNN并与AutoML结合，效率提升10倍|一周AI最火论文

大数据文摘

9+阅读 · 2019年6月4日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

CVPR 2019 | 图像压缩重建也能抵御对抗样本

CVPR 2019 | 图像压缩重建也能抵御对抗样本

计算机视觉life

3+阅读 · 2019年4月26日

计算机视觉的不同任务

计算机视觉的不同任务

专知

5+阅读 · 2018年8月27日

黑魔法防御术：Ian Goodfellow对抗样本研究现状与未来方向综述

黑魔法防御术：Ian Goodfellow对抗样本研究现状与未来方向综述

专知

29+阅读 · 2018年5月26日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

已删除

将门创投

7+阅读 · 2018年4月25日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

相关论文

PatchGuard: A Provably Robust Defense against Adversarial Patches via Small Receptive Fields and Masking

Arxiv

0+阅读 · 2021年3月31日

How Robust are Randomized Smoothing based Defenses to Data Poisoning?

Arxiv

0+阅读 · 2021年3月30日

What Causes Optical Flow Networks to be Vulnerable to Physical Adversarial Attacks

Arxiv

0+阅读 · 2021年3月30日

Hidden Backdoor Attack against Semantic Segmentation Models

Arxiv

0+阅读 · 2021年3月30日

Optimal Transport as a Defense Against Adversarial Attacks

Arxiv

0+阅读 · 2021年3月30日

Lagrangian Objective Function Leads to Improved Unforeseen Attack Generalization in Adversarial Training

Arxiv

0+阅读 · 2021年3月29日

Understanding Robustness of Transformers for Image Classification

Arxiv

2+阅读 · 2021年3月26日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

A Baseline for Few-Shot Image Classification

Arxiv

7+阅读 · 2020年3月1日

DPatch: An Adversarial Patch Attack on Object Detectors

DPatch: An Adversarial Patch Attack on Object Detectors

Arxiv

4+阅读 · 2018年9月15日

微信扫码咨询专知VIP会员