VenoMave: 针对语音识别的有针对性毒化攻击 (VenoMave: Targeted Poisoning Against Speech Recognition) - 专知论文

会员服务 ·

0

攻击 · 语音识别 · 识别 · 迁移率 · 识别系统 ·

2023 年 4 月 20 日

VenoMave: Targeted Poisoning Against Speech Recognition

翻译：VenoMave: 针对语音识别的有针对性毒化攻击

Hojjat Aghakhani,Lea Schönherr,Thorsten Eisenhofer,Dorothea Kolossa,Thorsten Holz,Christopher Kruegel,Giovanni Vigna

Despite remarkable improvements, automatic speech recognition is susceptible to adversarial perturbations. Compared to standard machine learning architectures, these attacks are significantly more challenging, especially since the inputs to a speech recognition system are time series that contain both acoustic and linguistic properties of speech. Extracting all recognition-relevant information requires more complex pipelines and an ensemble of specialized components. Consequently, an attacker needs to consider the entire pipeline. In this paper, we present VENOMAVE, the first training-time poisoning attack against speech recognition. Similar to the predominantly studied evasion attacks, we pursue the same goal: leading the system to an incorrect and attacker-chosen transcription of a target audio waveform. In contrast to evasion attacks, however, we assume that the attacker can only manipulate a small part of the training data without altering the target audio waveform at runtime. We evaluate our attack on two datasets: TIDIGITS and Speech Commands. When poisoning less than 0.17% of the dataset, VENOMAVE achieves attack success rates of more than 80.0%, without access to the victim's network architecture or hyperparameters. In a more realistic scenario, when the target audio waveform is played over the air in different rooms, VENOMAVE maintains a success rate of up to 73.3%. Finally, VENOMAVE achieves an attack transferability rate of 36.4% between two different model architectures.

翻译：尽管自动语音识别取得了显著的进展，但其容易受到对抗性扰动的影响。与标准机器学习体系结构相比，这些攻击要困难得多，特别是由于语音识别系统的输入是包含语音的声学和语言属性的时间序列。提取所有与识别相关的信息需要更复杂的管线和一组专门的组件。因此，攻击者需要考虑整个管线。在本文中，我们提出VENOMAVE，针对语音识别的第一种训练时毒化攻击。与主要研究逃避攻击相似，我们追求相同的目标：将系统引导到攻击者选择的目标音频波形的不正确转录。然而，与逃避攻击不同，我们假设攻击者只能操纵少量训练数据而不改变运行时的目标音频波形。我们在两个数据集TIDIGITS和语音命令上评估了我们的攻击。当毒化数据集的比例小于0.17%时，VENOMAVE可以在没有访问受害者网络架构或超参数情况下实现80.0%以上的攻击成功率。在更现实的场景中，当目标音频波形在不同房间播放时，VENOMAVE可以保持高达73.3%的攻击成功率。最后，VENOMAVE在两种不同的模型架构之间实现了36.4%的攻击可迁移率。

0

相关内容

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

专知会员服务

83+阅读 · 2023年5月1日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

35+阅读 · 2020年12月28日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI实战圣经《Machine Learning Yearning》第1-52章中英文版pdf分享

AI实战圣经《Machine Learning Yearning》第1-52章中英文版pdf分享

深度学习与NLP

15+阅读 · 2018年9月8日

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

专知

14+阅读 · 2018年2月4日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

离子注入合成In纳米颗粒在Al薄膜中超导性质的研究

国家自然科学基金

0+阅读 · 2015年12月31日

深海立管在参激和涡激联合作用下的模型实验及数值模拟

国家自然科学基金

0+阅读 · 2015年12月31日

基于meet/miss-in-the-middle思想若干密码攻击方法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

CoFe2O4/BaSrTiO3复合势垒多铁隧道结的制备及隧穿特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

软件定义网络的主动攻击与防御机制研究

国家自然科学基金

2+阅读 · 2013年12月31日

镶嵌型Fe2O3/MCMB材料的原位合成和电极反应机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Bloom filter的下一代互联网可扩展组播技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

二氧化钛/溴化银-银/导电聚合物纳米复合材料的制备及可见光催化机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

海啸所引发的大气重力波之传播特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

SLC22A3-Histamin-LDL途径介导冠心病的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Lattice-Based Quantum Advantage from Rotated Measurements

Arxiv

0+阅读 · 2023年6月6日

Evaluating the Effectiveness of Natural Language Inference for Hate Speech Detection in Languages with Limited Labeled Data

Arxiv

0+阅读 · 2023年6月6日

A Functional Data Perspective and Baseline On Multi-Layer Out-of-Distribution Detection

Arxiv

0+阅读 · 2023年6月6日

ReContrast: Domain-Specific Anomaly Detection via Contrastive Reconstruction

Arxiv

0+阅读 · 2023年6月5日

Large Language Models can be Guided to Evade AI-Generated Text Detection

Arxiv

0+阅读 · 2023年6月5日

Local Model Reconstruction Attacks in Federated Learning and their Uses

Arxiv

0+阅读 · 2023年6月2日

DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection

Arxiv

0+阅读 · 2023年6月2日

On the Possibilities of AI-Generated Text Detection

Arxiv

0+阅读 · 2023年6月1日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

【2023新书】实用数据隐私:增强数据的隐私性和安全性，599页pdf

专知会员服务

83+阅读 · 2023年5月1日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

不可错过！UIUC最新《对抗机器学习》课程，附PPT

专知会员服务

35+阅读 · 2020年12月28日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

【KDD2020-Tutorial】对抗性的攻击和防御:前沿、进展与实践，171页ppt

专知会员服务

80+阅读 · 2020年8月24日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

AI实战圣经《Machine Learning Yearning》第1-52章中英文版pdf分享

AI实战圣经《Machine Learning Yearning》第1-52章中英文版pdf分享

深度学习与NLP

15+阅读 · 2018年9月8日

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

【论文推荐】最新5篇语音识别（ASR）相关论文—音频对抗样本、对抗性语音识别系统、声学模型、序列到序列、口语可理解性矫正

专知

14+阅读 · 2018年2月4日

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

【推荐】(Python)多种模型(Naive Bayes, SVM, CNN, LSTM, etc)实现推文情感分析

机器学习研究会

13+阅读 · 2017年12月25日

相关论文

Lattice-Based Quantum Advantage from Rotated Measurements

Arxiv

0+阅读 · 2023年6月6日

Evaluating the Effectiveness of Natural Language Inference for Hate Speech Detection in Languages with Limited Labeled Data

Arxiv

0+阅读 · 2023年6月6日

A Functional Data Perspective and Baseline On Multi-Layer Out-of-Distribution Detection

Arxiv

0+阅读 · 2023年6月6日

ReContrast: Domain-Specific Anomaly Detection via Contrastive Reconstruction

Arxiv

0+阅读 · 2023年6月5日

Large Language Models can be Guided to Evade AI-Generated Text Detection

Arxiv

0+阅读 · 2023年6月5日

Local Model Reconstruction Attacks in Federated Learning and their Uses

Arxiv

0+阅读 · 2023年6月2日

DeepfakeArt Challenge: A Benchmark Dataset for Generative AI Art Forgery and Data Poisoning Detection

Arxiv

0+阅读 · 2023年6月2日

On the Possibilities of AI-Generated Text Detection

Arxiv

0+阅读 · 2023年6月1日

Deep Learning for UAV-based Object Detection and Tracking: A Survey

Arxiv

62+阅读 · 2021年10月25日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

相关基金

离子注入合成In纳米颗粒在Al薄膜中超导性质的研究

国家自然科学基金

0+阅读 · 2015年12月31日

深海立管在参激和涡激联合作用下的模型实验及数值模拟

国家自然科学基金

0+阅读 · 2015年12月31日

基于meet/miss-in-the-middle思想若干密码攻击方法的研究

国家自然科学基金

0+阅读 · 2013年12月31日

CoFe2O4/BaSrTiO3复合势垒多铁隧道结的制备及隧穿特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

软件定义网络的主动攻击与防御机制研究

国家自然科学基金

2+阅读 · 2013年12月31日

镶嵌型Fe2O3/MCMB材料的原位合成和电极反应机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于Bloom filter的下一代互联网可扩展组播技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

二氧化钛/溴化银-银/导电聚合物纳米复合材料的制备及可见光催化机理的研究

国家自然科学基金

0+阅读 · 2012年12月31日

海啸所引发的大气重力波之传播特性研究

国家自然科学基金

0+阅读 · 2011年12月31日

SLC22A3-Histamin-LDL途径介导冠心病的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员