野生的逆向音义测量:可转移的对作者分析的攻击 (Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling) - 专知论文

会员服务 ·

0

Extensibility · MoDELS · Notability · 置换 · Performer ·

2021 年 1 月 27 日

Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling

翻译：野生的逆向音义测量:可转移的对作者分析的攻击

Chris Emmery,Ákos Kádár,Grzegorz Chrupała

from arxiv, Accepted to EACL 2021

Written language contains stylistic cues that can be exploited to automatically infer a variety of potentially sensitive author information. Adversarial stylometry intends to attack such models by rewriting an author's text. Our research proposes several components to facilitate deployment of these adversarial attacks in the wild, where neither data nor target models are accessible. We introduce a transformer-based extension of a lexical replacement attack, and show it achieves high transferability when trained on a weakly labeled corpus -- decreasing target model performance below chance. While not completely inconspicuous, our more successful attacks also prove notably less detectable by humans. Our framework therefore provides a promising direction for future privacy-preserving adversarial attacks.

翻译：书写语言包含可以用来自动推断各种潜在敏感作者信息的文体提示。反逆性tylologization 打算通过重写作者的文字来攻击这些模型。我们的研究提出了几个组成部分, 以便利在野外部署这些对抗性攻击, 因为在野外既无法获得数据, 也无法获得目标模型。我们引入了基于变压器的替换攻击扩展, 并表明在使用标签不高的立体时, 它可以实现高可转移性, 也就是降低目标模型的概率。虽然并非完全不明显, 我们更成功的攻击也证明人类难以察觉。因此, 我们的框架为未来保护隐私的对抗性攻击提供了一个有希望的方向。

0

相关内容

Extensibility

iOS 8 提供的应用间和应用跟系统的功能交互特性。

Today (iOS and OS X): widgets for the Today view of Notification Center
Share (iOS and OS X): post content to web services or share content with others
Actions (iOS and OS X): app extensions to view or manipulate inside another app
Photo Editing (iOS): edit a photo or video in Apple's Photos app with extensions from a third-party apps
Finder Sync (OS X): remote file storage in the Finder with support for Finder content annotation
Storage Provider (iOS): an interface between files inside an app and other apps on a user's device
Custom Keyboard (iOS): system-wide alternative keyboards

Source: iOS 8 Extensions: Apple’s Plan for a Powerful App Ecosystem

【ICLR2021】面向词替换攻击的对抗训练方法

专知会员服务

21+阅读 · 2021年2月7日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

40+阅读 · 2020年5月4日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【AAAI 2019 Tutorial】对抗机器学习（Adversarial Machine Learning），Bo Li，Dawn Song，Yevgeniy Vorobeychik

【AAAI 2019 Tutorial】对抗机器学习（Adversarial Machine Learning），Bo Li，Dawn Song，Yevgeniy Vorobeychik

专知会员服务

29+阅读 · 2019年11月18日

已删除

将门创投

4+阅读 · 2019年10月11日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

SoK: Attacks on Industrial Control Logic and Formal Verification-Based Defenses

SoK: Attacks on Industrial Control Logic and Formal Verification-Based Defenses

Arxiv

0+阅读 · 2021年3月23日

Grey-box Adversarial Attack And Defence For Sentiment Classification

Arxiv

0+阅读 · 2021年3月22日

BASAR:Black-box Attack on Skeletal Action Recognition

Arxiv

0+阅读 · 2021年3月19日

Admix: Enhancing the Transferability of Adversarial Attacks

Arxiv

0+阅读 · 2021年3月19日

Weight Poisoning Attacks on Pre-trained Models

Weight Poisoning Attacks on Pre-trained Models

Arxiv

5+阅读 · 2020年4月14日

Deflecting Adversarial Attacks

Deflecting Adversarial Attacks

Arxiv

8+阅读 · 2020年2月18日

Generative Adversarial Networks: A Survey and Taxonomy

Generative Adversarial Networks: A Survey and Taxonomy

Arxiv

14+阅读 · 2019年6月4日

Interpretable Adversarial Training for Text

Interpretable Adversarial Training for Text

Arxiv

5+阅读 · 2019年5月30日

Are Generative Classifiers More Robust to Adversarial Attacks?

Are Generative Classifiers More Robust to Adversarial Attacks?

Arxiv

4+阅读 · 2018年7月9日

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

Arxiv

18+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

相关VIP内容

【ICLR2021】面向词替换攻击的对抗训练方法

专知会员服务

21+阅读 · 2021年2月7日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

【ACL2020】对抗性文本生成，Improving Adversarial Text Generation

专知会员服务

52+阅读 · 2020年5月5日

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

【伯克利】黑盒机器翻译系统的模仿攻击与防御，Imitation Attacks and Defenses for Black-box Machine Translation Systems

专知会员服务

7+阅读 · 2020年5月4日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

40+阅读 · 2020年5月4日

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

【ACL2020-CMU】预训练模型权重攻击，Weight Poisoning Attacks on PTM

专知会员服务

12+阅读 · 2020年4月16日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【AAAI 2019 Tutorial】对抗机器学习（Adversarial Machine Learning），Bo Li，Dawn Song，Yevgeniy Vorobeychik

【AAAI 2019 Tutorial】对抗机器学习（Adversarial Machine Learning），Bo Li，Dawn Song，Yevgeniy Vorobeychik

专知会员服务

29+阅读 · 2019年11月18日

热门VIP内容

开通专知VIP会员享更多权益服务

GPT-5如何对齐？从硬性拒绝到安全完成：走向以输出为中心的安全训练

【伯克利博士论文】超越人类监督的视觉智能

【ICCV2025】SO(3) 上连续非保守动力系统的预测

2025年中国数据要素行业发展研究报告

相关资讯

已删除

将门创投

4+阅读 · 2019年10月11日

gan生成图像at 1024² 的代码论文

gan生成图像at 1024² 的代码论文

CreateAMind

4+阅读 · 2017年10月31日

相关论文

SoK: Attacks on Industrial Control Logic and Formal Verification-Based Defenses

SoK: Attacks on Industrial Control Logic and Formal Verification-Based Defenses

Arxiv

0+阅读 · 2021年3月23日

Grey-box Adversarial Attack And Defence For Sentiment Classification

Arxiv

0+阅读 · 2021年3月22日

BASAR:Black-box Attack on Skeletal Action Recognition

Arxiv

0+阅读 · 2021年3月19日

Admix: Enhancing the Transferability of Adversarial Attacks

Arxiv

0+阅读 · 2021年3月19日

Weight Poisoning Attacks on Pre-trained Models

Weight Poisoning Attacks on Pre-trained Models

Arxiv

5+阅读 · 2020年4月14日

Deflecting Adversarial Attacks

Deflecting Adversarial Attacks

Arxiv

8+阅读 · 2020年2月18日

Generative Adversarial Networks: A Survey and Taxonomy

Generative Adversarial Networks: A Survey and Taxonomy

Arxiv

14+阅读 · 2019年6月4日

Interpretable Adversarial Training for Text

Interpretable Adversarial Training for Text

Arxiv

5+阅读 · 2019年5月30日

Are Generative Classifiers More Robust to Adversarial Attacks?

Are Generative Classifiers More Robust to Adversarial Attacks?

Arxiv

4+阅读 · 2018年7月9日

Audio Adversarial Examples: Targeted Attacks on Speech-to-Text

Arxiv

18+阅读 · 2018年1月5日

微信扫码咨询专知VIP会员