TransFool:对神经机器翻译模型的反向攻击 (TransFool: An Adversarial Attack against Neural Machine Translation Models) - 专知论文

会员服务 ·

0

NMT · MoDELS · Machine Translation · 语义相似度 · 相似度 ·

2023 年 2 月 2 日

TransFool: An Adversarial Attack against Neural Machine Translation Models

翻译：TransFool:对神经机器翻译模型的反向攻击

Sahar Sadrizadeh,Ljiljana Dolamic,Pascal Frossard

Deep neural networks have been shown to be vulnerable to small perturbations of their inputs, known as adversarial attacks. In this paper, we investigate the vulnerability of Neural Machine Translation (NMT) models to adversarial attacks and propose a new attack algorithm called TransFool. To fool NMT models, TransFool builds on a multi-term optimization problem and a gradient projection step. By integrating the embedding representation of a language model, we generate fluent adversarial examples in the source language that maintain a high level of semantic similarity with the clean samples. Experimental results demonstrate that, for different translation tasks and NMT architectures, our white-box attack can severely degrade the translation quality while the semantic similarity between the original and the adversarial sentences stays high. Moreover, we show that TransFool is transferable to unknown target models. Finally, based on automatic and human evaluations, TransFool leads to improvement in terms of success rate, semantic similarity, and fluency compared to the existing attacks both in white-box and black-box settings. Thus, TransFool permits us to better characterize the vulnerability of NMT models and outlines the necessity to design strong defense mechanisms and more robust NMT systems for real-life applications.

翻译：深心神经网络被证明很容易受到其投入(称为对抗性攻击)的小扰动干扰。在本文中,我们调查神经机器翻译(NMT)模型易受对抗性攻击的脆弱性,并提出一个新的攻击算法,称为TransFool。为了愚弄NMT模型,TransFool基于一个多期优化问题和一个梯度投影步骤。通过整合语言模型的嵌入代表,我们在源语言中生成了流畅的对抗性例子,与清洁样本保持高度的语义相似性。实验结果显示,对于不同的翻译任务和NMT结构,我们的白箱攻击可以严重降低翻译质量,而原始和对抗性判决之间的语义相似性则保持高水平。此外,我们证明TransFool可以转换到未知的目标模型。最后,根据自动和人文评价,TransFool导致在成功率、语义相似性和流畅度方面得到改善,与白箱和黑箱环境中的现有攻击相比,两者的语义性相似性都表明,我们的白箱和黑箱结构可以严重降低翻译质量质量,而原始和对抗性相似性相似性相似性相似性相似性相似性相似性相似性相似性相似性相似性相似性也使我们得以更好地确定NMTMTMT型设计系统的脆弱性。

0

相关内容

NMT

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

核因子NF90在肝癌细胞中稳定细胞周期蛋白Cyclin E1 mRNA的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

电压门控钠离子通道Nav1.7通过MACC1调控NHE1促进胃癌增殖的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

内源性二氧化硫通过次磺酸修饰NF-kappa B p65抑制TNF-alpha诱导的脂肪细胞炎症

国家自然科学基金

0+阅读 · 2015年12月31日

PNPLA7蛋白调控肝脏脂肪代谢和非酒精性脂肪肝病的作用和分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

MACC1调控葡萄糖代谢抑制胃癌细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

miR-140在肿瘤转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications

Arxiv

0+阅读 · 2023年3月24日

Foiling Explanations in Deep Neural Networks

Arxiv

0+阅读 · 2023年3月24日

Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning Models

Arxiv

0+阅读 · 2023年3月24日

Sibling-Attack: Rethinking Transferable Adversarial Attacks against Face Recognition

Arxiv

0+阅读 · 2023年3月22日

Secure Aggregation in Federated Learning is not Private: Leaking User Data at Large Scale through Model Modification

Arxiv

0+阅读 · 2023年3月21日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Adversarial Transfer Learning

Adversarial Transfer Learning

Arxiv

12+阅读 · 2018年12月6日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

VIP会员

文章信息

相关主题

Machine Translation

语义相似度

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

LibRec 精选：推荐系统的常用数据集

LibRec 精选：推荐系统的常用数据集

LibRec智能推荐

17+阅读 · 2019年2月15日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications

Arxiv

0+阅读 · 2023年3月24日

Foiling Explanations in Deep Neural Networks

Arxiv

0+阅读 · 2023年3月24日

Query Efficient Decision Based Sparse Attacks Against Black-Box Deep Learning Models

Arxiv

0+阅读 · 2023年3月24日

Sibling-Attack: Rethinking Transferable Adversarial Attacks against Face Recognition

Arxiv

0+阅读 · 2023年3月22日

Secure Aggregation in Federated Learning is not Private: Leaking User Data at Large Scale through Model Modification

Arxiv

0+阅读 · 2023年3月21日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Attribute-Guided Adversarial Training for Robustness to Natural Perturbations

Arxiv

15+阅读 · 2020年12月3日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Adversarial Transfer Learning

Adversarial Transfer Learning

Arxiv

12+阅读 · 2018年12月6日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

相关基金

核因子NF90在肝癌细胞中稳定细胞周期蛋白Cyclin E1 mRNA的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

电压门控钠离子通道Nav1.7通过MACC1调控NHE1促进胃癌增殖的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-21/PDCD4/NF-κB通路在血小板抗菌促糖尿病溃疡愈合中的作用及分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

内源性二氧化硫通过次磺酸修饰NF-kappa B p65抑制TNF-alpha诱导的脂肪细胞炎症

国家自然科学基金

0+阅读 · 2015年12月31日

PNPLA7蛋白调控肝脏脂肪代谢和非酒精性脂肪肝病的作用和分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

MACC1调控葡萄糖代谢抑制胃癌细胞衰老的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Periostin在前列腺癌侵袭转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

miR-140在肿瘤转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

p进表示的伽罗瓦上同调

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员