TrojViT：视觉变压器中的特洛伊木马插入 (TrojViT: Trojan Insertion in Vision Transformers) - 专知论文

会员服务 ·

0

攻击 · 后门攻击 · CNN · 翻转 · 比特 ·

2023 年 3 月 23 日

TrojViT: Trojan Insertion in Vision Transformers

翻译：TrojViT：视觉变压器中的特洛伊木马插入

Mengxin Zheng,Qian Lou,Lei Jiang

from arxiv, 10 pages, 4 figures, 11 tables

Vision Transformers (ViTs) have demonstrated the state-of-the-art performance in various vision-related tasks. The success of ViTs motivates adversaries to perform backdoor attacks on ViTs. Although the vulnerability of traditional CNNs to backdoor attacks is well-known, backdoor attacks on ViTs are seldom-studied. Compared to CNNs capturing pixel-wise local features by convolutions, ViTs extract global context information through patches and attentions. Na\"ively transplanting CNN-specific backdoor attacks to ViTs yields only a low clean data accuracy and a low attack success rate. In this paper, we propose a stealth and practical ViT-specific backdoor attack $TrojViT$. Rather than an area-wise trigger used by CNN-specific backdoor attacks, TrojViT generates a patch-wise trigger designed to build a Trojan composed of some vulnerable bits on the parameters of a ViT stored in DRAM memory through patch salience ranking and attention-target loss. TrojViT further uses minimum-tuned parameter update to reduce the bit number of the Trojan. Once the attacker inserts the Trojan into the ViT model by flipping the vulnerable bits, the ViT model still produces normal inference accuracy with benign inputs. But when the attacker embeds a trigger into an input, the ViT model is forced to classify the input to a predefined target class. We show that flipping only few vulnerable bits identified by TrojViT on a ViT model using the well-known RowHammer can transform the model into a backdoored one. We perform extensive experiments of multiple datasets on various ViT models. TrojViT can classify $99.64\%$ of test images to a target class by flipping $345$ bits on a ViT for ImageNet.

翻译：视觉变压器（ViT）在各种与视觉相关的任务中表现出最先进的性能。ViTs的成功激发了对ViTs进行后门攻击的攻击者。虽然传统CNN对后门攻击的脆弱性是众所周知的，但是ViTs的后门攻击研究很少。与通过卷积捕获像素级局部特征的CNN相比，ViTs通过补丁和注意力提取全局上下文信息。将CNN特定的后门攻击天真地移植到ViTs只能获得低干净数据准确性和低攻击成功率。在本文中，我们提出了一种隐蔽且实用的ViT特定后门攻击$TrojViT$。TrojViT不是通过CNN特定的区域触发器，而是生成一个补丁级触发器，旨在通过补丁显着度排名和注意目标损失构建由DRAM存储的ViT参数上的一些易受攻击比特组成的特洛伊木马。TrojViT进一步使用最小调节的参数更新来减少特洛伊木马的比特数。一旦攻击者通过翻转易受攻击的位将木马插入到ViT模型中，ViT模型仍然会以良性输入产生正常的推理准确性。但是，当攻击者将触发器嵌入输入时，ViT模型被迫将输入分类为预定义的目标类。我们表明，在ImageNet的ViT上翻转仅几个TrojViT识别的易感位即可使用Well-Known RowHammer将模型转换为带后门的模型。我们在各种ViT模型的多个数据集上进行了广泛的实验。TrojViT可以通过翻转ViT上的345个位来对测试图像分类到目标类中的99.64％。

0

相关内容

ICML2023 | 轻量级视觉Transformer(ViT)的预训练实践手册

ICML2023 | 轻量级视觉Transformer(ViT)的预训练实践手册

专知会员服务

41+阅读 · 2023年5月10日

【AAAI 2022】IBM Research《对抗性机器学习AdvML》最新教程（附slides与video）

【AAAI 2022】IBM Research《对抗性机器学习AdvML》最新教程（附slides与video）

专知会员服务

39+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ACM Multimedia2021-tutorial】可信赖多媒体分析

【ACM Multimedia2021-tutorial】可信赖多媒体分析

专知会员服务

18+阅读 · 2021年10月20日

【ICCV 2021 】Vision Transformer中的相对位置编码

专知会员服务

30+阅读 · 2021年7月30日

【ICML2021】计算机视觉中的自注意力机制，谷歌伯克利166页ppt教程

专知会员服务

134+阅读 · 2021年7月22日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

一文带你浏览Graph Transformers

一文带你浏览Graph Transformers

极市平台

1+阅读 · 2022年7月12日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

不可错过！图宾根大学《深度学习》课程，12讲述神经网络、GNN、GAN、序列模型等主题，附Slides与151页pdf笔记

不可错过！图宾根大学《深度学习》课程，12讲述神经网络、GNN、GAN、序列模型等主题，附Slides与151页pdf笔记

专知

18+阅读 · 2021年5月8日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

基于多元互信息和快速稀疏多核学习的高光谱遥感影像地物分类

国家自然科学基金

0+阅读 · 2015年12月31日

用于痫样脑电在线检测的gm-C小波滤波器实现理论与方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

MEMS数字地震检波器专用DSP芯片优化设计

国家自然科学基金

1+阅读 · 2015年12月31日

阿尔茨海默病PLD3基因深度测序及其罕见突变的致病机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

超导量子计算中的量子纠错

国家自然科学基金

2+阅读 · 2014年12月31日

全固态多程相位调制宽带脉冲光源

国家自然科学基金

0+阅读 · 2013年12月31日

空间通信片上网络数字系统形式化验证与可靠性分析

国家自然科学基金

0+阅读 · 2012年12月31日

大型语义辞典的自动生成及在文本分析中的应用

国家自然科学基金

1+阅读 · 2012年12月31日

用高精度超导重力技术检测和研究“Hum”信号

国家自然科学基金

0+阅读 · 2012年12月31日

芯片硬件木马安全检测方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

Text Classification via Large Language Models

Arxiv

0+阅读 · 2023年5月15日

Improving Defensive Distillation using Teacher Assistant

Arxiv

0+阅读 · 2023年5月14日

Transformers in Time Series: A Survey

Arxiv

0+阅读 · 2023年5月11日

Salient Mask-Guided Vision Transformer for Fine-Grained Classification

Arxiv

0+阅读 · 2023年5月11日

Rethinking Local Perception in Lightweight Vision Transformer

Arxiv

0+阅读 · 2023年5月11日

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Arxiv

21+阅读 · 2022年9月27日

Federated Graph Neural Networks: Overview, Techniques and Challenges

Arxiv

16+阅读 · 2022年2月15日

Attention Mechanisms in Computer Vision: A Survey

Arxiv

58+阅读 · 2021年11月15日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

22+阅读 · 2021年9月23日

VIP会员

文章信息

相关主题

相关VIP内容

ICML2023 | 轻量级视觉Transformer(ViT)的预训练实践手册

ICML2023 | 轻量级视觉Transformer(ViT)的预训练实践手册

专知会员服务

41+阅读 · 2023年5月10日

【AAAI 2022】IBM Research《对抗性机器学习AdvML》最新教程（附slides与video）

【AAAI 2022】IBM Research《对抗性机器学习AdvML》最新教程（附slides与video）

专知会员服务

39+阅读 · 2022年3月18日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【ACM Multimedia2021-tutorial】可信赖多媒体分析

【ACM Multimedia2021-tutorial】可信赖多媒体分析

专知会员服务

18+阅读 · 2021年10月20日

【ICCV 2021 】Vision Transformer中的相对位置编码

专知会员服务

30+阅读 · 2021年7月30日

【ICML2021】计算机视觉中的自注意力机制，谷歌伯克利166页ppt教程

专知会员服务

134+阅读 · 2021年7月22日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

【伯克利】自回归模型的局部掩卷积，Locally Masked Convolution for Autoregressive Models

专知会员服务

20+阅读 · 2020年6月23日

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

【微软】大型神经语言模型的对抗性训练，Adversarial Training for Large Neural Language Models

专知会员服务

51+阅读 · 2020年5月3日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

一文带你浏览Graph Transformers

一文带你浏览Graph Transformers

极市平台

1+阅读 · 2022年7月12日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

不可错过！图宾根大学《深度学习》课程，12讲述神经网络、GNN、GAN、序列模型等主题，附Slides与151页pdf笔记

不可错过！图宾根大学《深度学习》课程，12讲述神经网络、GNN、GAN、序列模型等主题，附Slides与151页pdf笔记

专知

18+阅读 · 2021年5月8日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

【泡泡一分钟】用于评估视觉惯性里程计的TUM VI数据集

泡泡机器人SLAM

11+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

【论文推荐】最新5篇图像分割相关论文—条件随机场和深度特征学习、移动端网络、长期视觉定位、主动学习、主动轮廓模型、生成对抗性网络

专知

13+阅读 · 2018年1月23日

相关论文

Text Classification via Large Language Models

Arxiv

0+阅读 · 2023年5月15日

Improving Defensive Distillation using Teacher Assistant

Arxiv

0+阅读 · 2023年5月14日

Transformers in Time Series: A Survey

Arxiv

0+阅读 · 2023年5月11日

Salient Mask-Guided Vision Transformer for Fine-Grained Classification

Arxiv

0+阅读 · 2023年5月11日

Rethinking Local Perception in Lightweight Vision Transformer

Arxiv

0+阅读 · 2023年5月11日

A Survey on Graph Neural Networks and Graph Transformers in Computer Vision: A Task-Oriented Perspective

Arxiv

21+阅读 · 2022年9月27日

Federated Graph Neural Networks: Overview, Techniques and Challenges

Arxiv

16+阅读 · 2022年2月15日

Attention Mechanisms in Computer Vision: A Survey

Arxiv

58+阅读 · 2021年11月15日

A Survey of Visual Transformers

Arxiv

39+阅读 · 2021年11月11日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

22+阅读 · 2021年9月23日

相关基金

基于多元互信息和快速稀疏多核学习的高光谱遥感影像地物分类

国家自然科学基金

0+阅读 · 2015年12月31日

用于痫样脑电在线检测的gm-C小波滤波器实现理论与方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

MEMS数字地震检波器专用DSP芯片优化设计

国家自然科学基金

1+阅读 · 2015年12月31日

阿尔茨海默病PLD3基因深度测序及其罕见突变的致病机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

超导量子计算中的量子纠错

国家自然科学基金

2+阅读 · 2014年12月31日

全固态多程相位调制宽带脉冲光源

国家自然科学基金

0+阅读 · 2013年12月31日

空间通信片上网络数字系统形式化验证与可靠性分析

国家自然科学基金

0+阅读 · 2012年12月31日

大型语义辞典的自动生成及在文本分析中的应用

国家自然科学基金

1+阅读 · 2012年12月31日

用高精度超导重力技术检测和研究“Hum”信号

国家自然科学基金

0+阅读 · 2012年12月31日

芯片硬件木马安全检测方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员