2022年的语音空间系统说明:议长匿名,配有与特效相配的F0轨 (VoicePrivacy 2022 System Description: Speaker Anonymization with Feature-matched F0 Trajectories) - 专知论文

会员服务 ·

0

INFORMS · 估计/估计量 · Performer · Better · Networking ·

2022 年 10 月 31 日

VoicePrivacy 2022 System Description: Speaker Anonymization with Feature-matched F0 Trajectories

翻译：2022年的语音空间系统说明:议长匿名,配有与特效相配的F0轨

Ünal Ege Gaznepoglu,Anna Leschanowsky,Nils Peters

from arxiv, 4 pages, 4 figures, 2 tables, submitted to VoicePrivacy Challenge 2022

We introduce a novel method to improve the performance of the VoicePrivacy Challenge 2022 baseline B1 variants. Among the known deficiencies of x-vector-based anonymization systems is the insufficient disentangling of the input features. In particular, the fundamental frequency (F0) trajectories, which are used for voice synthesis without any modifications. Especially in cross-gender conversion, this situation causes unnatural sounding voices, increases word error rates (WERs), and personal information leakage. Our submission overcomes this problem by synthesizing an F0 trajectory, which better harmonizes with the anonymized x-vector. We utilized a low-complexity deep neural network to estimate an appropriate F0 value per frame, using the linguistic content from the bottleneck features (BN) and the anonymized x-vector. Our approach results in a significantly improved anonymization system and increased naturalness of the synthesized voice. Consequently, our results suggest that F0 extraction is not required for voice anonymization.

翻译：我们引入了一种创新方法来改进2022年语音探索挑战基线B1变体的性能。在已知的基于x矢量的匿名系统缺陷中,未充分分解输入特征。特别是用于语音合成而没有任何修改的基本频率(F0)轨迹。特别是在跨性别转换方面,这种情况导致非自然声音的探测、增加单词错误率和个人信息泄漏。我们的呈文通过合成一个F0轨迹克服了这一问题,F0轨迹与匿名化x-矢量系统更加一致。我们利用一个低兼容深度神经网络来估计每个框架的适当F0值,使用来自瓶颈特征(BN)和匿名化x-Victor的语言内容。我们的方法导致一个显著改进的匿名系统以及合成声音的自然性增强。因此,我们的呈文结果表明,声音匿名不需要F0提取。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

γ#27688;基丁酸能与谷氨酸能神经系统调节失衡与抑郁症

国家自然科学基金

0+阅读 · 2009年12月31日

基于遥感的棉花长势监测模型及其栽培应用

国家自然科学基金

0+阅读 · 2008年12月31日

Predicting Ejection Fraction from Chest X-rays Using Computer Vision for Diagnosing Heart Failure

Arxiv

0+阅读 · 2022年12月19日

Less is More: Parameter-Free Text Classification with Gzip

Arxiv

0+阅读 · 2022年12月19日

Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Arxiv

0+阅读 · 2022年12月18日

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth

Arxiv

0+阅读 · 2022年12月16日

SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory

Arxiv

0+阅读 · 2022年12月15日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

相关论文

Predicting Ejection Fraction from Chest X-rays Using Computer Vision for Diagnosing Heart Failure

Arxiv

0+阅读 · 2022年12月19日

Less is More: Parameter-Free Text Classification with Gzip

Arxiv

0+阅读 · 2022年12月19日

Video Reconstruction from a Single Motion Blurred Image using Learned Dynamic Phase Coding

Arxiv

0+阅读 · 2022年12月18日

MVSFormer: Multi-View Stereo by Learning Robust Image Features and Temperature-based Depth

Arxiv

0+阅读 · 2022年12月16日

SteerNeRF: Accelerating NeRF Rendering via Smooth Viewpoint Trajectory

Arxiv

0+阅读 · 2022年12月15日

相关基金

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

γ#27688;基丁酸能与谷氨酸能神经系统调节失衡与抑郁症

国家自然科学基金

0+阅读 · 2009年12月31日

基于遥感的棉花长势监测模型及其栽培应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员