STATAT: 简单、逐步、无限制地攻击守则模式 (STRATA: Simple, Gradient-Free Attacks for Models of Code) - 专知论文

会员服务 ·

0

SimPLe · 词元分析器 · Less · MoDELS · 样例 ·

2021 年 8 月 19 日

STRATA: Simple, Gradient-Free Attacks for Models of Code

翻译：STATAT: 简单、逐步、无限制地攻击守则模式

Jacob M. Springer,Bryn Marie Reinstadler,Una-May O'Reilly

from arxiv, KDD'21 AdvML Workshop

Neural networks are well-known to be vulnerable to imperceptible perturbations in the input, called adversarial examples, that result in misclassification. Generating adversarial examples for source code poses an additional challenge compared to the domains of images and natural language, because source code perturbations must retain the functional meaning of the code. We identify a striking relationship between token frequency statistics and learned token embeddings: the L2 norm of learned token embeddings increases with the frequency of the token except for the highest-frequnecy tokens. We leverage this relationship to construct a simple and efficient gradient-free method for generating state-of-the-art adversarial examples on models of code. Our method empirically outperforms competing gradient-based methods with less information and less computational effort.

翻译：众所周知,神经网络很容易在投入(称为对抗性实例)中受到不可察觉的干扰,从而导致分类错误。源代码生成对抗性实例与图像和自然语言领域相比构成额外挑战,因为源代码扰动必须保留代码的功能含义。我们发现象征性频率统计与学习的代号嵌入之间有着惊人的关系:学习的代号嵌入的L2规范随着代号频率的增加而增加,除了最廉价的代号之外。我们利用这种关系来构建一种简单而有效的无梯度方法,以生成最先进的代号模式对抗性实例。我们的方法在经验上优于基于梯度的相互竞争方法,信息较少,计算努力较少。

0

相关内容

SimPLe

【NAACL2021】长序列自然语言处理, 250页ppt

【NAACL2021】长序列自然语言处理, 250页ppt

专知会员服务

62+阅读 · 2021年6月7日

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

专知会员服务

37+阅读 · 2021年1月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

已删除

将门创投

6+阅读 · 2018年12月3日

Adversarial Attacks on ML Defense Models Competition

Adversarial Attacks on ML Defense Models Competition

Arxiv

1+阅读 · 2021年10月15日

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

Arxiv

0+阅读 · 2021年10月15日

An Optimization Perspective on Realizing Backdoor Injection Attacks on Deep Neural Networks in Hardware

Arxiv

0+阅读 · 2021年10月14日

Traceback of Data Poisoning Attacks in Neural Networks

Traceback of Data Poisoning Attacks in Neural Networks

Arxiv

0+阅读 · 2021年10月13日

Enhancing the Transferability of Adversarial Attacks through Variance Tuning

Arxiv

4+阅读 · 2021年3月29日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

AdderSR: Towards Energy Efficient Image Super-Resolution

AdderSR: Towards Energy Efficient Image Super-Resolution

Arxiv

9+阅读 · 2020年9月18日

Interpretable Adversarial Training for Text

Interpretable Adversarial Training for Text

Arxiv

5+阅读 · 2019年5月30日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

Sequential Attacks on Agents for Long-Term Adversarial Goals

Sequential Attacks on Agents for Long-Term Adversarial Goals

Arxiv

5+阅读 · 2018年7月5日

VIP会员

文章信息

相关主题

词元分析器

相关VIP内容

【NAACL2021】长序列自然语言处理, 250页ppt

【NAACL2021】长序列自然语言处理, 250页ppt

专知会员服务

62+阅读 · 2021年6月7日

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

专知会员服务

37+阅读 · 2021年1月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】通过真实世界实践赋能机器人自主性

军用无人机集群技术尚未成熟——但潜力可期

人工智能安全治理白皮书（2025）

AgentOps综述：分类、挑战与未来方向

相关资讯

已删除

将门创投

6+阅读 · 2018年12月3日

相关论文

Adversarial Attacks on ML Defense Models Competition

Adversarial Attacks on ML Defense Models Competition

Arxiv

1+阅读 · 2021年10月15日

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models

Arxiv

0+阅读 · 2021年10月15日

An Optimization Perspective on Realizing Backdoor Injection Attacks on Deep Neural Networks in Hardware

Arxiv

0+阅读 · 2021年10月14日

Traceback of Data Poisoning Attacks in Neural Networks

Traceback of Data Poisoning Attacks in Neural Networks

Arxiv

0+阅读 · 2021年10月13日

Enhancing the Transferability of Adversarial Attacks through Variance Tuning

Arxiv

4+阅读 · 2021年3月29日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

AdderSR: Towards Energy Efficient Image Super-Resolution

AdderSR: Towards Energy Efficient Image Super-Resolution

Arxiv

9+阅读 · 2020年9月18日

Interpretable Adversarial Training for Text

Interpretable Adversarial Training for Text

Arxiv

5+阅读 · 2019年5月30日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

Sequential Attacks on Agents for Long-Term Adversarial Goals

Sequential Attacks on Agents for Long-Term Adversarial Goals

Arxiv

5+阅读 · 2018年7月5日

微信扫码咨询专知VIP会员