你们需要的只是互不关联的表述来建立扬声器匿名系统吗? (Are disentangled representations all you need to build speaker anonymization systems?) - 专知论文

会员服务 ·

0

向量化 · 表示 · 语音识别 · 变换 · MoDELS ·

2023 年 1 月 13 日

Are disentangled representations all you need to build speaker anonymization systems?

翻译：你们需要的只是互不关联的表述来建立扬声器匿名系统吗?

Pierre Champion,Denis Jouvet,Anthony Larcher

Speech signals contain a lot of sensitive information, such as the speaker's identity, which raises privacy concerns when speech data get collected. Speaker anonymization aims to transform a speech signal to remove the source speaker's identity while leaving the spoken content unchanged. Current methods perform the transformation by relying on content/speaker disentanglement and voice conversion. Usually, an acoustic model from an automatic speech recognition system extracts the content representation while an x-vector system extracts the speaker representation. Prior work has shown that the extracted features are not perfectly disentangled. This paper tackles how to improve features disentanglement, and thus the converted anonymized speech. We propose enhancing the disentanglement by removing speaker information from the acoustic model using vector quantization. Evaluation done using the VoicePrivacy 2022 toolkit showed that vector quantization helps conceal the original speaker identity while maintaining utility for speech recognition.

翻译：语音信号包含许多敏感信息,例如发言者的身份,这在收集语音数据时引起隐私问题。演讲者匿名的目的是转换语音信号,删除源演讲者的身份,同时不改变发言内容。目前的方法是通过依赖内容/声音脱节和语音转换来进行转换。通常,自动语音识别系统的声学模型会提取内容表述,而x-矢量系统则会提取发言代表。先前的工作显示,所提取的特征没有完全分解。本文将讨论如何改进特征分解,从而转换匿名发言。我们提议通过使用矢量四分法将演讲者信息从音响模型中去除,从而强化混乱状态。使用2022 VoicePrivacy 工具包进行的评估表明,矢量四分解有助于隐藏原发言者的身份,同时保持语音识别的实用性。

0

相关内容

向量化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

SIPA1在乳腺癌细胞中质核转运机制及其调控基因表达的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

CSE1L在神经母细胞瘤发展中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

星形胶质细胞RORα在多巴胺神经元损伤中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

ING3：原发性肝癌的诊断与治疗新靶点

国家自然科学基金

0+阅读 · 2012年12月31日

基于SPR效应增强异质结可见光薄膜光催化剂研究

国家自然科学基金

0+阅读 · 2012年12月31日

可见及近红外宽光谱响应的高效固态量子点敏化太阳能电池

国家自然科学基金

0+阅读 · 2012年12月31日

癌/睾丸抗原HCA587对转录因子NF-κB的调节作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

FHL在肝癌发生发展中的功能及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

Arxiv

0+阅读 · 2023年3月9日

Paraphrasing Techniques for Maritime QA system

Arxiv

0+阅读 · 2023年3月9日

X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion

Arxiv

0+阅读 · 2023年3月9日

Computing the Difference of Conjunctive Queries Efficiently

Arxiv

0+阅读 · 2023年3月9日

TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

Arxiv

0+阅读 · 2023年3月8日

SumREN: Summarizing Reported Speech about Events in News

Arxiv

0+阅读 · 2023年3月8日

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

Arxiv

0+阅读 · 2023年3月7日

Time-frequency Network for Robust Speaker Recognition

Arxiv

0+阅读 · 2023年3月7日

One for All: Neural Joint Modeling of Entities and Events

Arxiv

11+阅读 · 2018年12月1日

Attention Is All You Need

Arxiv

27+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

大模型解决方案白皮书：社交陪伴场景全流程落地指南

面向具身操作的视觉-语言-动作模型综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition

Arxiv

0+阅读 · 2023年3月9日

Paraphrasing Techniques for Maritime QA system

Arxiv

0+阅读 · 2023年3月9日

X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion

Arxiv

0+阅读 · 2023年3月9日

Computing the Difference of Conjunctive Queries Efficiently

Arxiv

0+阅读 · 2023年3月9日

TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

Arxiv

0+阅读 · 2023年3月8日

SumREN: Summarizing Reported Speech about Events in News

Arxiv

0+阅读 · 2023年3月8日

Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling

Arxiv

0+阅读 · 2023年3月7日

Time-frequency Network for Robust Speaker Recognition

Arxiv

0+阅读 · 2023年3月7日

One for All: Neural Joint Modeling of Entities and Events

Arxiv

11+阅读 · 2018年12月1日

Attention Is All You Need

Arxiv

27+阅读 · 2017年12月6日

相关基金

SIPA1在乳腺癌细胞中质核转运机制及其调控基因表达的功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

CSE1L在神经母细胞瘤发展中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

星形胶质细胞RORα在多巴胺神经元损伤中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

ING3：原发性肝癌的诊断与治疗新靶点

国家自然科学基金

0+阅读 · 2012年12月31日

基于SPR效应增强异质结可见光薄膜光催化剂研究

国家自然科学基金

0+阅读 · 2012年12月31日

可见及近红外宽光谱响应的高效固态量子点敏化太阳能电池

国家自然科学基金

0+阅读 · 2012年12月31日

癌/睾丸抗原HCA587对转录因子NF-κB的调节作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

FHL在肝癌发生发展中的功能及其分子机制

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员