隐藏说话者性别：使用分析合成流水线中的零证据讲话者表示法的方法 (Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline) - 专知论文

会员服务 ·

0

编解码器 · 编解码 · 合成 · 分析 · 解码 ·

2023 年 3 月 24 日

Hiding speaker's sex in speech using zero-evidence speaker representation in an analysis/synthesis pipeline

翻译：隐藏说话者性别：使用分析合成流水线中的零证据讲话者表示法的方法

Paul-Gauthier Noé,Xiaoxiao Miao,Xin Wang,Junichi Yamagishi,Jean-François Bonastre,Driss Matrouf

from arxiv, Accepted to ICASSP 2023

The use of modern vocoders in an analysis/synthesis pipeline allows us to investigate high-quality voice conversion that can be used for privacy purposes. Here, we propose to transform the speaker embedding and the pitch in order to hide the sex of the speaker. ECAPA-TDNN-based speaker representation fed into a HiFiGAN vocoder is protected using a neural-discriminant analysis approach, which is consistent with the zero-evidence concept of privacy. This approach significantly reduces the information in speech related to the speaker's sex while preserving speech content and some consistency in the resulting protected voices.

翻译：使用现代语音编解码器在分析/合成流水线中的研究，允许我们研究可以用于隐私保护的高质量语音转换。在这里，我们提出了一种转换讲话者嵌入和音调的方法，以隐藏讲话者的性别。ECAPA-TDNN讲话者表示法送入HiFiGAN编解码器中使用神经鉴别分析方法进行保护，这与隐私的零证据概念是一致的。这种方法显着降低了与讲话者性别相关的语音信息，同时保留了语音内容和一些一致性的保护语音。

0

相关内容

编解码器

中国科大凌震华【语音信号处理基础 Fundamentals of Speech Signal Processing】(2021年秋季学期)课程PPT

中国科大凌震华【语音信号处理基础 Fundamentals of Speech Signal Processing】(2021年秋季学期)课程PPT

专知会员服务

19+阅读 · 2022年2月25日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

【KDD2019教程】从浅层到深层的语言表达:预训练、微调，等等，From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond

【KDD2019教程】从浅层到深层的语言表达:预训练、微调，等等，From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond

专知会员服务

16+阅读 · 2019年11月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

word2vec中文语料训练

word2vec中文语料训练

全球人工智能

12+阅读 · 2018年4月23日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

基于多标签流形学习的中国古典音乐情感分析方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

香蕉果实直链淀粉合成关键酶基因MaGBSSI-3表达的分子调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

衰老对骨髓基质干细胞刺激心肌梗死后心脏前体细胞群重建的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

PPARγ拮抗Egr-1对增生性瘢痕TGF-β1促纤维化信号的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

有损陷门函数与标准模型下CCA2安全的公钥密码体制

国家自然科学基金

0+阅读 · 2011年12月31日

Establishing Shared Query Understanding in an Open Multi-Agent System

Arxiv

0+阅读 · 2023年5月16日

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

Arxiv

0+阅读 · 2023年5月16日

Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences

Arxiv

0+阅读 · 2023年5月15日

Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations

Arxiv

0+阅读 · 2023年5月14日

Better speech synthesis through scaling

Arxiv

0+阅读 · 2023年5月12日

VIP会员

文章信息

相关主题

相关VIP内容

中国科大凌震华【语音信号处理基础 Fundamentals of Speech Signal Processing】(2021年秋季学期)课程PPT

中国科大凌震华【语音信号处理基础 Fundamentals of Speech Signal Processing】(2021年秋季学期)课程PPT

专知会员服务

19+阅读 · 2022年2月25日

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

【KDD2020】CAST:一种基于相关关系的多尺度数据自适应光谱聚类算法,CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data

专知会员服务

20+阅读 · 2020年6月11日

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

【KDD2019教程】从浅层到深层的语言表达:预训练、微调，等等，From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond

【KDD2019教程】从浅层到深层的语言表达:预训练、微调，等等，From Shallow to Deep Language Representations: Pre-training, Fine-tuning, and Beyond

专知会员服务

16+阅读 · 2019年11月4日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

word2vec中文语料训练

word2vec中文语料训练

全球人工智能

12+阅读 · 2018年4月23日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Establishing Shared Query Understanding in an Open Multi-Agent System

Arxiv

0+阅读 · 2023年5月16日

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

Arxiv

0+阅读 · 2023年5月16日

Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences

Arxiv

0+阅读 · 2023年5月15日

Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations

Arxiv

0+阅读 · 2023年5月14日

Better speech synthesis through scaling

Arxiv

0+阅读 · 2023年5月12日

相关基金

基于多标签流形学习的中国古典音乐情感分析方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

香蕉果实直链淀粉合成关键酶基因MaGBSSI-3表达的分子调控机制

国家自然科学基金

0+阅读 · 2013年12月31日

衰老对骨髓基质干细胞刺激心肌梗死后心脏前体细胞群重建的影响及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

PPARγ拮抗Egr-1对增生性瘢痕TGF-β1促纤维化信号的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

有损陷门函数与标准模型下CCA2安全的公钥密码体制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员