PRADA: 对神经定级模型的实际黑人反反反反反方攻击 (PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models) - 专知论文

会员服务 ·

0

秩 · MoDELS · 黑盒 · 可辨认的 · INFORMS ·

2022 年 5 月 21 日

PRADA: Practical Black-Box Adversarial Attacks against Neural Ranking Models

翻译：PRADA: 对神经定级模型的实际黑人反反反反反方攻击

Chen Wu,Ruqing Zhang,Jiafeng Guo,Maarten de Rijke,Yixing Fan,Xueqi Cheng

Neural ranking models (NRMs) have shown remarkable success in recent years, especially with pre-trained language models. However, deep neural models are notorious for their vulnerability to adversarial examples. Adversarial attacks may become a new type of web spamming technique given our increased reliance on neural information retrieval models. Therefore, it is important to study potential adversarial attacks to identify vulnerabilities of NRMs before they are deployed. In this paper, we introduce the Word Substitution Ranking Attack (WSRA) task against NRMs, which aims to promote a target document in rankings by adding adversarial perturbations to its text. We focus on the decision-based black-box attack setting, where the attackers have no access to the model parameters and gradients, but can only acquire the rank positions of the partial retrieved list by querying the target model. This attack setting is realistic in real-world search engines. We propose a novel Pseudo Relevance-based ADversarial ranking Attack method (PRADA) that learns a surrogate model based on Pseudo Relevance Feedback (PRF) to generate gradients for finding the adversarial perturbations. Experiments on two web search benchmark datasets show that PRADA can outperform existing attack strategies and successfully fool the NRM with small indiscernible perturbations of text.

翻译：近年来,神经等级模型(NRM)表现出显著的成功,特别是在经过培训的语言模型方面。但是,深神经模型因其易受对抗性实例的影响而臭名昭著。由于我们日益依赖神经信息检索模型,反向攻击可能成为一种新型的网络垃圾技术。因此,必须研究潜在的对抗性攻击,在部署之前先确定NRM的脆弱性。在本文中,我们提出了针对NRM的“WSRA”替代性攻击(WSRA)任务,目的是通过在文本中添加对抗性干扰来在排名中促进一个目标文件。我们侧重于基于决定的黑箱攻击设置,攻击者无法使用模型参数和梯度,但只能通过查询目标模型获得部分检索清单的等级位置。这种攻击环境在现实世界搜索引擎中是现实的。我们提出了一个新的“Pseudo Delitive-broup-Adversari等级攻击方法(PRADADADA)”,目的是通过Pseubdo Refect(PRF)来学习一个基于防御性文件模型的替代模型模型模型,以便生成基于现有攻击性数据库的升级搜索和测试性战略,从而显示测试性战略。

0

相关内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

抑制DAGT1抵抗脂肪酸诱导的胰岛β细胞损伤及其分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

肺内皮细胞S1PR1受体在流感病毒所致ARDS中的作用

国家自然科学基金

1+阅读 · 2014年12月31日

枸杞黄酮类化合物抑制FPR表达的抗肿瘤作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

韧性城市卫生健康领域适应气候变化评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

介尺度磁性复合囊泡状结构材料的可控构筑及性能

国家自然科学基金

0+阅读 · 2012年12月31日

PRMT2对乳腺癌细胞增殖和耐药的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞型朊蛋白PrPC在睡眠剥夺与AD发病间发挥关联作用的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

中层大气—电离层（MAI）探测系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

微腔光电导太赫兹辐射源辐射特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

北冰洋初级生产力粒级结构与环境耦合

国家自然科学基金

0+阅读 · 2008年12月31日

Defense Against Multi-target Trojan Attacks

Defense Against Multi-target Trojan Attacks

Arxiv

0+阅读 · 2022年7月8日

Guiding the retraining of convolutional neural networks against adversarial inputs

Arxiv

0+阅读 · 2022年7月8日

On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Network

On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Network

Arxiv

0+阅读 · 2022年7月7日

Towards the Practical Utility of Federated Learning in the Medical Domain

Towards the Practical Utility of Federated Learning in the Medical Domain

Arxiv

0+阅读 · 2022年7月7日

Advances in adversarial attacks and defenses in computer vision: A survey

Arxiv

22+阅读 · 2021年9月2日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

VIP会员

文章信息

相关主题

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

小规模训练指南：打造世界级大语言模型的关键方法

无人机编队飞行：复杂环境中作战的策略、挑战与应用

大模型APP，AI时代第一个爆款

从数据中心视角出发的高效大语言模型训练综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Defense Against Multi-target Trojan Attacks

Defense Against Multi-target Trojan Attacks

Arxiv

0+阅读 · 2022年7月8日

Guiding the retraining of convolutional neural networks against adversarial inputs

Arxiv

0+阅读 · 2022年7月8日

On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Network

On the Relationship Between Adversarial Robustness and Decision Region in Deep Neural Network

Arxiv

0+阅读 · 2022年7月7日

Towards the Practical Utility of Federated Learning in the Medical Domain

Towards the Practical Utility of Federated Learning in the Medical Domain

Arxiv

0+阅读 · 2022年7月7日

Advances in adversarial attacks and defenses in computer vision: A survey

Arxiv

22+阅读 · 2021年9月2日

Composite Adversarial Attacks

Arxiv

12+阅读 · 2020年12月10日

Privacy and Robustness in Federated Learning: Attacks and Defenses

Arxiv

35+阅读 · 2020年12月7日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

A Survey of Adversarial Learning on Graphs

Arxiv

38+阅读 · 2020年3月10日

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Adversarial Attacks and Defenses in Images, Graphs and Text: A Review

Arxiv

17+阅读 · 2019年10月9日

相关基金

抑制DAGT1抵抗脂肪酸诱导的胰岛β细胞损伤及其分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

肺内皮细胞S1PR1受体在流感病毒所致ARDS中的作用

国家自然科学基金

1+阅读 · 2014年12月31日

枸杞黄酮类化合物抑制FPR表达的抗肿瘤作用及机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

韧性城市卫生健康领域适应气候变化评价方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

介尺度磁性复合囊泡状结构材料的可控构筑及性能

国家自然科学基金

0+阅读 · 2012年12月31日

PRMT2对乳腺癌细胞增殖和耐药的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞型朊蛋白PrPC在睡眠剥夺与AD发病间发挥关联作用的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

中层大气—电离层（MAI）探测系统研究

国家自然科学基金

0+阅读 · 2012年12月31日

微腔光电导太赫兹辐射源辐射特性研究

国家自然科学基金

0+阅读 · 2009年12月31日

北冰洋初级生产力粒级结构与环境耦合

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员