UNICORN: 一个统一的后门触发反演框架 (UNICORN: A Unified Backdoor Trigger Inversion Framework) - 专知论文

会员服务 ·

0

反演问题 · 攻击 · DNN · 高效性 · 后门攻击 ·

2023 年 4 月 5 日

UNICORN: A Unified Backdoor Trigger Inversion Framework

翻译：UNICORN: 一个统一的后门触发反演框架

Zhenting Wang,Kai Mei,Juan Zhai,Shiqing Ma

The backdoor attack, where the adversary uses inputs stamped with triggers (e.g., a patch) to activate pre-planted malicious behaviors, is a severe threat to Deep Neural Network (DNN) models. Trigger inversion is an effective way of identifying backdoor models and understanding embedded adversarial behaviors. A challenge of trigger inversion is that there are many ways of constructing the trigger. Existing methods cannot generalize to various types of triggers by making certain assumptions or attack-specific constraints. The fundamental reason is that existing work does not consider the trigger's design space in their formulation of the inversion problem. This work formally defines and analyzes the triggers injected in different spaces and the inversion problem. Then, it proposes a unified framework to invert backdoor triggers based on the formalization of triggers and the identified inner behaviors of backdoor models from our analysis. Our prototype UNICORN is general and effective in inverting backdoor triggers in DNNs. The code can be found at https://github.com/RU-System-Software-and-Security/UNICORN.

翻译：后门攻击指黑客使用带有触发器（例如补丁）的输入来激活预先植入的恶意行为，这种攻击对深度神经网络（DNN）模型构成严重威胁。触发器反演是识别后门模型和了解嵌入式对抗行为的有效方式。触发器反演的挑战在于，有许多方法来构建触发器。现有方法不能通过某些假设或针对特定攻击的约束来推广到各种类型的触发器。根本原因是现有工作在其反演问题的公式化中没有考虑触发器的设计空间。本文正式定义和分析注入不同空间中的触发器及反演问题。然后，我们提出了一个统一的框架来反演基于触发器的后门，该框架基于触发器的形式化以及从我们的分析中确定的后门模型的内部行为。我们的原型UNICORN可以在DNN中反演后门触发器，并且具有通用性和高效性。代码可以在https://github.com/RU-System-Software-and-Security/UNICORN中找到。

0

相关内容

反演问题

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【CVPR2021】面向通用领域自适应的领域共识聚类

专知会员服务

30+阅读 · 2021年5月6日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

【2020新书】Web应用安全，331页pdf

【2020新书】Web应用安全，331页pdf

专知会员服务

25+阅读 · 2020年10月24日

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

专知会员服务

43+阅读 · 2020年7月19日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

PaperWeekly

0+阅读 · 2022年11月7日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

统一贝叶斯框架下的活动轮廓模型OCT心管图像序列分割

国家自然科学基金

0+阅读 · 2012年12月31日

焊缝典型缺陷磁记忆特征提取与反演的定量化基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

慢性痛导致抑郁情感障碍的神经可塑性调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

缺陷和非磁性元素掺杂诱导的氮化镓纳米结构磁性机理及调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

受干扰信号的自适应滤波及信号检测

国家自然科学基金

0+阅读 · 2011年12月31日

基于局部不变性特征流的相异场景密集匹配

国家自然科学基金

0+阅读 · 2011年12月31日

独立电力系统输入状态稳定性分析的基础研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于本体的深层网络数据集成方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

体数据多维特征信息分析与增强

国家自然科学基金

0+阅读 · 2009年12月31日

精神分裂症易感因子神经调节素-1和其受体ErbB4调节GABA释放的机理和生物学意义

国家自然科学基金

0+阅读 · 2009年12月31日

Towards Understanding Crypto Money Laundering in Web3 Through the Lenses of Ethereum Heists

Arxiv

1+阅读 · 2023年5月24日

NCC: Natural Concurrency Control for Strictly Serializable Datastores by Avoiding the Timestamp-Inversion Pitfall

Arxiv

0+阅读 · 2023年5月23日

A Critical Reexamination of Intra-List Distance and Dispersion

Arxiv

0+阅读 · 2023年5月23日

DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection

Arxiv

0+阅读 · 2023年5月23日

The Mean Squared Error of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors

Arxiv

0+阅读 · 2023年5月22日

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

Arxiv

0+阅读 · 2023年5月20日

Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation

Arxiv

0+阅读 · 2023年5月19日

Distributionally Robust Bayesian Optimization with $φ$-divergences

Arxiv

0+阅读 · 2023年5月19日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

【CVPR2021】面向通用领域自适应的领域共识聚类

专知会员服务

30+阅读 · 2021年5月6日

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

【ICLR2021】神经元注意力蒸馏消除DNN中的后门触发器

专知会员服务

15+阅读 · 2021年1月31日

【2020新书】Web应用安全，331页pdf

【2020新书】Web应用安全，331页pdf

专知会员服务

25+阅读 · 2020年10月24日

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

【2020 最新论文】节点邻近的图池化的层次表示学习 Graph Pooling with Node Proximity for Hierarchical Representation Learning

专知会员服务

43+阅读 · 2020年7月19日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

EMNLP 2022 | 北大提出基于中间层特征的在线文本后门防御新SOTA

PaperWeekly

0+阅读 · 2022年11月7日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

【论文推荐】最新八篇图像描述生成相关论文—比较级对抗学习、正则化RNNs、深层网络、视觉对话、婴儿说话、自我检索

专知

10+阅读 · 2018年4月12日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Towards Understanding Crypto Money Laundering in Web3 Through the Lenses of Ethereum Heists

Arxiv

1+阅读 · 2023年5月24日

NCC: Natural Concurrency Control for Strictly Serializable Datastores by Avoiding the Timestamp-Inversion Pitfall

Arxiv

0+阅读 · 2023年5月23日

A Critical Reexamination of Intra-List Distance and Dispersion

Arxiv

0+阅读 · 2023年5月23日

DiffProtect: Generate Adversarial Examples with Diffusion Models for Facial Privacy Protection

Arxiv

0+阅读 · 2023年5月23日

The Mean Squared Error of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors

Arxiv

0+阅读 · 2023年5月22日

Unified Embedding: Battle-Tested Feature Representations for Web-Scale ML Systems

Arxiv

0+阅读 · 2023年5月20日

Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation

Arxiv

0+阅读 · 2023年5月19日

Distributionally Robust Bayesian Optimization with $φ$-divergences

Arxiv

0+阅读 · 2023年5月19日

Adversarial Robustness of Representation Learning for Knowledge Graphs

Arxiv

10+阅读 · 2022年9月30日

Backdoor Learning: A Survey

Arxiv

14+阅读 · 2020年10月26日

相关基金

统一贝叶斯框架下的活动轮廓模型OCT心管图像序列分割

国家自然科学基金

0+阅读 · 2012年12月31日

焊缝典型缺陷磁记忆特征提取与反演的定量化基础研究

国家自然科学基金

0+阅读 · 2012年12月31日

慢性痛导致抑郁情感障碍的神经可塑性调控机制

国家自然科学基金

0+阅读 · 2011年12月31日

缺陷和非磁性元素掺杂诱导的氮化镓纳米结构磁性机理及调控研究

国家自然科学基金

0+阅读 · 2011年12月31日

受干扰信号的自适应滤波及信号检测

国家自然科学基金

0+阅读 · 2011年12月31日

基于局部不变性特征流的相异场景密集匹配

国家自然科学基金

0+阅读 · 2011年12月31日

独立电力系统输入状态稳定性分析的基础研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于本体的深层网络数据集成方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

体数据多维特征信息分析与增强

国家自然科学基金

0+阅读 · 2009年12月31日

精神分裂症易感因子神经调节素-1和其受体ErbB4调节GABA释放的机理和生物学意义

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员