基准测试对文本指导下的对抗攻击的鲁棒性 (Benchmarking Robustness to Text-Guided Corruptions) - 专知论文

会员服务 ·

0

基准测试 · 鲁棒 · 抗攻击 · 基准 · 对抗攻击 ·

2023 年 4 月 6 日

Benchmarking Robustness to Text-Guided Corruptions

翻译：基准测试对文本指导下的对抗攻击的鲁棒性

Mohammadreza Mofayezi,Yasamin Medghalchi

This study investigates the robustness of image classifiers to text-guided corruptions. We utilize diffusion models to edit images to different domains. Unlike other works that use synthetic or hand-picked data for benchmarking, we use diffusion models as they are generative models capable of learning to edit images while preserving their semantic content. Thus, the corruptions will be more realistic and the comparison will be more informative. Also, there is no need for manual labeling and we can create large-scale benchmarks with less effort. We define a prompt hierarchy based on the original ImageNet hierarchy to apply edits in different domains. As well as introducing a new benchmark we try to investigate the robustness of different vision models. The results of this study demonstrate that the performance of image classifiers decreases significantly in different language-based corruptions and edit domains. We also observe that convolutional models are more robust than transformer architectures. Additionally, we see that common data augmentation techniques can improve the performance on both the original data and the edited images. The findings of this research can help improve the design of image classifiers and contribute to the development of more robust machine learning systems. The code for generating the benchmark will be made available online upon publication.

翻译：本研究探讨了图像分类器对文本指导下的对抗攻击的鲁棒性。我们使用扩散模型编辑图像到不同的领域。与其他使用合成或手工选择数据进行基准测试的工作不同，我们使用扩散模型作为它们是生成模型，能够学习编辑图像同时保留其语义内容。因此，这些对抗攻击将更加真实，比较结果将更加明确。此外，无需手动标注，我们可以用更少的努力创建大规模的基准测试。我们基于原始ImageNet层次结构定义了提示层次结构，以在不同领域应用编辑。除了引入新的基准测试外，我们还试图研究不同视觉模型的鲁棒性。本研究的结果表明，图像分类器在不同的基于语言的攻击和编辑领域中的性能显著降低。我们还观察到，卷积模型比Transformer架构更为鲁棒。此外，我们发现常见的数据增强技术可以提高原始数据和编辑图像的性能。本研究的发现有助于改善图像分类器的设计，并有助于开发更为鲁棒的机器学习系统。在发表后，生成基准测试的代码将在线上提供。

0

相关内容

基准测试

基准测试是指通过设计科学的测试方法、测试工具和测试系统，实现对一类测试对象的某项性能指标进行定量的和可对比的测试。

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

专知会员服务

32+阅读 · 2023年5月19日

【ICML2022】基于少样本策略泛化的决策Transformer

【ICML2022】基于少样本策略泛化的决策Transformer

专知会员服务

37+阅读 · 2022年7月11日

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

13+阅读 · 2019年12月27日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

模型攻击：鲁棒性联邦学习研究的最新进展

模型攻击：鲁棒性联邦学习研究的最新进展

机器之心

35+阅读 · 2020年6月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

泡泡机器人SLAM

10+阅读 · 2019年4月26日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

专知

15+阅读 · 2018年5月15日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

NF-kappaB/miR-23a/GSL1通路在鼻咽癌放疗抵抗中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

microRNA-223通过IKKα调控NF-κB信号通路导致慢性淋巴细胞白血病耐药的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

G4解旋酶G4R1通过解旋G4调控基因表达和在细胞增殖中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

社会化媒体中数字内容扩散的二元影响模型及控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于偏磁薄膜矫正的石榴石型光电式磁场/电流传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知的鲁棒性语音情感识别研究

国家自然科学基金

3+阅读 · 2012年12月31日

用于<=11nm超细微结构制备的聚苯乙烯-聚(alpha-羟基羧酸)嵌段共聚物的引导组装

国家自然科学基金

1+阅读 · 2012年12月31日

可信软件及服务的度量、评估、认证体系标准研究

国家自然科学基金

3+阅读 · 2011年12月31日

一种自适应信任协商模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

C2C市场的信誉评价体系对消费者信任行为的有效性研究：基于我国环境的内容分析与实验检验

国家自然科学基金

0+阅读 · 2008年12月31日

Scale Matters: Attribution Meets the Wavelet Domain to Explain Model Sensitivity to Image Corruptions

Arxiv

0+阅读 · 2023年5月24日

USB: A Unified Summarization Benchmark Across Tasks and Domains

Arxiv

0+阅读 · 2023年5月23日

NeRFuser: Large-Scale Scene Representation by NeRF Fusion

Arxiv

0+阅读 · 2023年5月22日

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers

Arxiv

0+阅读 · 2023年5月22日

HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models

Arxiv

0+阅读 · 2023年5月22日

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models

Arxiv

0+阅读 · 2023年5月22日

Benchmarking White Blood Cell Classification Under Domain Shift

Arxiv

0+阅读 · 2023年5月19日

Few-shot 3D Shape Generation

Arxiv

0+阅读 · 2023年5月19日

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

Arxiv

42+阅读 · 2023年4月19日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

VIP会员

文章信息

相关主题

相关VIP内容

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

【ICML2023】调整语言模型作为增强少样本学习的训练数据生成器

专知会员服务

32+阅读 · 2023年5月19日

【ICML2022】基于少样本策略泛化的决策Transformer

【ICML2022】基于少样本策略泛化的决策Transformer

专知会员服务

37+阅读 · 2022年7月11日

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

【Hugging Face】指导文本生成与约束波束搜索🤗Transformers，Guiding Text Generation with Constrained Beam Search in 🤗 Transformers

专知会员服务

22+阅读 · 2022年3月18日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

【CVPR2020】通过自适应GANs生成不同的图像，Diverse Image Generation via Self-Conditioned GANs

专知会员服务

34+阅读 · 2020年6月19日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

【康奈尔大学】度量数据粒度，Measuring Dataset Granularity

专知会员服务

13+阅读 · 2019年12月27日

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

【Nature论文】用于理解图像分类决策和改进神经网络鲁棒性的对抗性解释（Adversarial Explanations for Understanding Image Classiﬁcation Decisions and Improved Neural Network Robustness ）

专知会员服务

13+阅读 · 2019年11月25日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【NeurIPS2025】VideoLucy：用于长视频理解的深度记忆回溯机制

不确定环境下无人机与无人地面车辆编队的地下勘探规划算法 | 122页

【NTU博士论文】端到端鲁棒自动语音识别的最新进展

用于强化学习的扩散模型：基础、分类与发展

相关资讯

模型攻击：鲁棒性联邦学习研究的最新进展

模型攻击：鲁棒性联邦学习研究的最新进展

机器之心

35+阅读 · 2020年6月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

【泡泡一分钟】优化对比度增强以提高SLAM重定位环境中视觉跟踪的稳健性

泡泡机器人SLAM

10+阅读 · 2019年4月26日

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

【论文推荐】最新八篇情感分析相关论文—Pair-wise判别器、多模态情感分析、上下文语境、Gated 卷积网络

专知

20+阅读 · 2018年6月29日

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

【论文推荐】最新十篇目标跟踪相关论文—多帧光流跟踪、动态图学习、MV-YOLO、姿态估计、深度核相关滤波、Benchmark

专知

13+阅读 · 2018年5月26日

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

【论文推荐】最新八篇生成对抗网络相关论文—条件翻译、RGB-D动作识别、量子生成对抗网络、语义对齐、视频摘要、视觉-文本注意力

专知

15+阅读 · 2018年5月15日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

【论文推荐】最新六篇图像分割相关论文—控制、全卷积网络、子空间表示、多模态图像分割

专知

25+阅读 · 2018年4月15日

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

【论文推荐】最新5篇行人再识别（ReID）相关论文—迁移学习、特征集成、重排序、多通道金字塔、深层生成模型

专知

12+阅读 · 2018年3月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Scale Matters: Attribution Meets the Wavelet Domain to Explain Model Sensitivity to Image Corruptions

Arxiv

0+阅读 · 2023年5月24日

USB: A Unified Summarization Benchmark Across Tasks and Domains

Arxiv

0+阅读 · 2023年5月23日

NeRFuser: Large-Scale Scene Representation by NeRF Fusion

Arxiv

0+阅读 · 2023年5月22日

SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers

Arxiv

0+阅读 · 2023年5月22日

HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language Models

Arxiv

0+阅读 · 2023年5月22日

DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models

Arxiv

0+阅读 · 2023年5月22日

Benchmarking White Blood Cell Classification Under Domain Shift

Arxiv

0+阅读 · 2023年5月19日

Few-shot 3D Shape Generation

Arxiv

0+阅读 · 2023年5月19日

NeuralField-LDM: Scene Generation with Hierarchical Latent Diffusion Models

Arxiv

42+阅读 · 2023年4月19日

Cross-Modal Object Tracking: Modality-Aware Representations and A Unified Benchmark

Arxiv

14+阅读 · 2021年11月11日

相关基金

NF-kappaB/miR-23a/GSL1通路在鼻咽癌放疗抵抗中的作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

microRNA-223通过IKKα调控NF-κB信号通路导致慢性淋巴细胞白血病耐药的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

G4解旋酶G4R1通过解旋G4调控基因表达和在细胞增殖中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

社会化媒体中数字内容扩散的二元影响模型及控制策略研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于偏磁薄膜矫正的石榴石型光电式磁场/电流传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于压缩感知的鲁棒性语音情感识别研究

国家自然科学基金

3+阅读 · 2012年12月31日

用于<=11nm超细微结构制备的聚苯乙烯-聚(alpha-羟基羧酸)嵌段共聚物的引导组装

国家自然科学基金

1+阅读 · 2012年12月31日

可信软件及服务的度量、评估、认证体系标准研究

国家自然科学基金

3+阅读 · 2011年12月31日

一种自适应信任协商模型研究

国家自然科学基金

0+阅读 · 2008年12月31日

C2C市场的信誉评价体系对消费者信任行为的有效性研究：基于我国环境的内容分析与实验检验

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员