RNN-Guard: 基于多帧攻击的循环神经网络的保证鲁棒性 (RNN-Guard: Certified Robustness Against Multi-frame Attacks for Recurrent Neural Networks) - 专知论文

会员服务 ·

0

RNN · 攻击 · 鲁棒 · 扰动 · 循环神经网络 ·

2023 年 4 月 17 日

RNN-Guard: Certified Robustness Against Multi-frame Attacks for Recurrent Neural Networks

翻译：RNN-Guard: 基于多帧攻击的循环神经网络的保证鲁棒性

Yunruo Zhang,Tianyu Du,Shouling Ji,Peng Tang,Shanqing Guo

from arxiv, 13 pages, 7 figures, 6 tables

It is well-known that recurrent neural networks (RNNs), although widely used, are vulnerable to adversarial attacks including one-frame attacks and multi-frame attacks. Though a few certified defenses exist to provide guaranteed robustness against one-frame attacks, we prove that defending against multi-frame attacks remains a challenging problem due to their enormous perturbation space. In this paper, we propose the first certified defense against multi-frame attacks for RNNs called RNN-Guard. To address the above challenge, we adopt the perturb-all-frame strategy to construct perturbation spaces consistent with those in multi-frame attacks. However, the perturb-all-frame strategy causes a precision issue in linear relaxations. To address this issue, we introduce a novel abstract domain called InterZono and design tighter relaxations. We prove that InterZono is more precise than Zonotope yet carries the same time complexity. Experimental evaluations across various datasets and model structures show that the certified robust accuracy calculated by RNN-Guard with InterZono is up to 2.18 times higher than that with Zonotope. In addition, we extend RNN-Guard as the first certified training method against multi-frame attacks to directly enhance RNNs' robustness. The results show that the certified robust accuracy of models trained with RNN-Guard against multi-frame attacks is 15.47 to 67.65 percentage points higher than those with other training methods.

翻译：循环神经网络(RNN)被广泛使用，但已经被证明是容易受到单帧攻击和多帧攻击的影响。尽管有一些已经存在的保护方法可以提供针对单帧攻击的保证鲁棒性，但我们证明，由于巨大的扰动空间，抵御多帧攻击仍然是一个具有挑战性的问题。在本文中，我们提出了第一种针对RNN多帧攻击的保证性防御- RNN-Guard。为了应对上述挑战，我们采用扰动-所有帧策略来构建与多帧攻击中相一致的扰动空间。然而，该扰动-所有帧策略会导致线性松弛中的精度问题。为了解决这个问题，我们引入了一种新的抽象域叫做InterZono，并设计了更紧的松弛度。我们证明InterZono比Zonotope更精确，而且耗时相同。在各种数据集和模型结构上进行的实验评估显示，通过InterZono计算的RNN-Guard的保证鲁棒准确性比使用Zonotope的保证鲁棒准确性高达2.18倍。此外，我们将RNN-Guard扩展为针对多帧攻击的第一个保护训练方法，以直接增强RNN的鲁棒性。结果显示，使用RNN-Guard进行训练，针对多帧攻击的可信鲁棒准确性比使用其他训练方法的准确性高出15.47至67.65个百分点。

0

相关内容

RNN

RNN:循环神经网络，是深度学习的一种模型。

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

专知会员服务

38+阅读 · 2022年3月9日

【AAAI2021】组合对抗攻击

【AAAI2021】组合对抗攻击

专知会员服务

51+阅读 · 2021年2月17日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

专知会员服务

104+阅读 · 2021年2月3日

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

【斯坦福】距离编码-为结构表示学习设计更强大的GNN.

专知会员服务

45+阅读 · 2020年9月3日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

专知会员服务

13+阅读 · 2019年10月31日

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

专知

2+阅读 · 2022年11月4日

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

PaperWeekly

0+阅读 · 2022年9月16日

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

PaperWeekly

1+阅读 · 2022年8月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

视频目标检测：Flow-based

视频目标检测：Flow-based

极市平台

22+阅读 · 2019年5月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

专知

13+阅读 · 2017年10月17日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

星形胶质细胞Connexin43在大脑皮层梗死继发丘脑变性中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

大规模在线游戏网络用户行为研究

国家自然科学基金

2+阅读 · 2015年12月31日

HEVC的低复杂度和并行编码方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

动态和多元非参数控制图的研究与应用

国家自然科学基金

0+阅读 · 2012年12月31日

融合推荐攻击在线集成检测和多维信任机制的可信推荐模型及关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于视觉显著内容的图像半脆弱自恢复水印算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

鲁棒性压缩感知关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

SSeCKS通过HSPA12B影响NF-kappa B的活性在星形胶质细胞炎性激活中的意义

国家自然科学基金

0+阅读 · 2011年12月31日

基于糖抗原STn和GM3的抗肿瘤疫苗研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于水传递的质子交换膜燃料电池动态响应过程模拟

国家自然科学基金

0+阅读 · 2009年12月31日

Does Black-box Attribute Inference Attacks on Graph Neural Networks Constitute Privacy Risk?

Arxiv

0+阅读 · 2023年6月1日

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

Arxiv

0+阅读 · 2023年6月1日

Graph-based methods coupled with specific distributional distances for adversarial attack detection

Arxiv

0+阅读 · 2023年5月31日

Neural Markov Jump Processes

Arxiv

0+阅读 · 2023年5月31日

Adversarial Detection: Attacking Object Detection in Real Time

Arxiv

0+阅读 · 2023年5月31日

Adversarial Driving: Attacking End-to-End Autonomous Driving

Arxiv

0+阅读 · 2023年5月31日

Backdoor Attacks Against Incremental Learners: An Empirical Evaluation Study

Arxiv

0+阅读 · 2023年5月28日

Graph Neural Network for Traffic Forecasting: A Survey

Arxiv

35+阅读 · 2021年1月27日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

VIP会员

文章信息

相关主题

循环神经网络

相关VIP内容

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

【Google AI】鲁棒图神经网络，Robust Graph Neural Networks

专知会员服务

38+阅读 · 2022年3月9日

【AAAI2021】组合对抗攻击

【AAAI2021】组合对抗攻击

专知会员服务

51+阅读 · 2021年2月17日

近期必读的六篇AAAI 2021【对抗攻击（Adversarial Attack）】相关论文和代码

专知会员服务

55+阅读 · 2021年2月17日

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

联邦学习中的隐私和鲁棒性:攻击和防御, 杨强等学者最新综述论文，16页pdf

专知会员服务

104+阅读 · 2021年2月3日

【AAAI2021】属性引导对抗训练的自然扰动鲁棒性

专知会员服务

26+阅读 · 2021年1月21日

【斯坦福】距离编码-为结构表示学习设计更强大的GNN.

专知会员服务

45+阅读 · 2020年9月3日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

【MIT】对抗鲁棒性的流形正则化，Manifold Regularization for Adversarial Robustness

专知会员服务

28+阅读 · 2020年3月11日

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

【ICCV 2019 Workshop】Adaptive Confidence Smoothing for Generalized Zero-Shot Learning，巴伊兰大学 Yuval Atzmon

专知会员服务

13+阅读 · 2019年10月31日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

【苏黎世联邦理工博士论文】深度神经网络的鲁棒性与正则化，233页pdf

专知

2+阅读 · 2022年11月4日

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

ICLR 2022 Spotlight | MSU联合MIT-IBM提出首个黑箱防御框架

PaperWeekly

0+阅读 · 2022年9月16日

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

AAAI 2022 | 对抗攻击鲁棒的异质图神经网络

PaperWeekly

1+阅读 · 2022年8月16日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

【阿里巴巴-WWW2020】对抗性多模态表示学习的点击率预测，Adversarial Multimodal RL

专知

11+阅读 · 2020年3月17日

视频目标检测：Flow-based

视频目标检测：Flow-based

极市平台

22+阅读 · 2019年5月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

【ACM MM论文集】国际多媒体顶级会议ACM Multimedia 2017 Open Access Repository

专知

13+阅读 · 2017年10月17日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Does Black-box Attribute Inference Attacks on Graph Neural Networks Constitute Privacy Risk?

Arxiv

0+阅读 · 2023年6月1日

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles

Arxiv

0+阅读 · 2023年6月1日

Graph-based methods coupled with specific distributional distances for adversarial attack detection

Arxiv

0+阅读 · 2023年5月31日

Neural Markov Jump Processes

Arxiv

0+阅读 · 2023年5月31日

Adversarial Detection: Attacking Object Detection in Real Time

Arxiv

0+阅读 · 2023年5月31日

Adversarial Driving: Attacking End-to-End Autonomous Driving

Arxiv

0+阅读 · 2023年5月31日

Backdoor Attacks Against Incremental Learners: An Empirical Evaluation Study

Arxiv

0+阅读 · 2023年5月28日

Graph Neural Network for Traffic Forecasting: A Survey

Arxiv

35+阅读 · 2021年1月27日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

A Comprehensive Survey on Transfer Learning

A Comprehensive Survey on Transfer Learning

Arxiv

121+阅读 · 2019年11月7日

相关基金

星形胶质细胞Connexin43在大脑皮层梗死继发丘脑变性中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

大规模在线游戏网络用户行为研究

国家自然科学基金

2+阅读 · 2015年12月31日

HEVC的低复杂度和并行编码方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

动态和多元非参数控制图的研究与应用

国家自然科学基金

0+阅读 · 2012年12月31日

融合推荐攻击在线集成检测和多维信任机制的可信推荐模型及关键技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于视觉显著内容的图像半脆弱自恢复水印算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

鲁棒性压缩感知关键技术的研究

国家自然科学基金

0+阅读 · 2012年12月31日

SSeCKS通过HSPA12B影响NF-kappa B的活性在星形胶质细胞炎性激活中的意义

国家自然科学基金

0+阅读 · 2011年12月31日

基于糖抗原STn和GM3的抗肿瘤疫苗研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于水传递的质子交换膜燃料电池动态响应过程模拟

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员