Swebender GAN: 具有超音效果的语音操控结构 (Wavebender GAN: An architecture for phonetically meaningful speech manipulation) - 专知论文

会员服务 ·

0

控制器 · 学成 · GAN · 模型评估 · 值域 ·

2022 年 2 月 22 日

Wavebender GAN: An architecture for phonetically meaningful speech manipulation

翻译：Swebender GAN: 具有超音效果的语音操控结构

Gustavo Teodoro Döhler Beck,Ulme Wennberg,Zofia Malisz,Gustav Eje Henter

from arxiv, 5 pages, 4 figures; to appear at ICASSP 2022

Deep learning has revolutionised synthetic speech quality. However, it has thus far delivered little value to the speech science community. The new methods do not meet the controllability demands that practitioners in this area require e.g.: in listening tests with manipulated speech stimuli. Instead, control of different speech properties in such stimuli is achieved by using legacy signal-processing methods. This limits the range, accuracy, and speech quality of the manipulations. Also, audible artefacts have a negative impact on the methodological validity of results in speech perception studies. This work introduces a system capable of manipulating speech properties through learning rather than design. The architecture learns to control arbitrary speech properties and leverages progress in neural vocoders to obtain realistic output. Experiments with copy synthesis and manipulation of a small set of core speech features (pitch, formants, and voice quality measures) illustrate the promise of the approach for producing speech stimuli that have accurate control and high perceptual quality.

翻译：深层学习使合成语言质量发生了革命性的变化。然而,迄今为止,它几乎没有给语言科学界带来什么价值。新方法没有满足该领域实践者所需要的可控制性要求,例如:用操纵的语音刺激进行听觉测试。相反,通过使用遗留的信号处理方法控制了这种刺激中的不同语音属性。这限制了操纵的广度、准确性和语言质量。此外,听力工艺对语音认知研究结果的方法有效性产生了负面影响。这项工作引入了一个能够通过学习而不是设计来操纵语言特性的系统。建筑学会控制任意的语音特性并利用神经电动器的进展获得现实的输出。在复制合成和操纵一小组核心语音特征(脉动、成型和声音质量措施)方面的实验显示了制作具有准确控制和高感官质量的语音模拟功能的前景。

0

相关内容

控制器

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

基于声发射信号特征的高速焊凝固热裂纹在线检测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向成像差异的高精度强适应SAR景象匹配算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于变分结构纹理分解的超分辨率图像复原方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于聚合物悬挂波导的可见光波段集成型传感器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

芍药切花ACS和ETR1基因的克隆、时空表达及功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

滇西老厂富银红土型锰矿次生富集机制及40Ar/39Ar年龄

国家自然科学基金

0+阅读 · 2012年12月31日

嵌段共聚物多级自组装模拟分子伴侣的结构与功能

国家自然科学基金

1+阅读 · 2011年12月31日

关系的分解与Domain的表示

国家自然科学基金

1+阅读 · 2011年12月31日

烟草内生菌多样性研究

国家自然科学基金

0+阅读 · 2008年12月31日

Composite Anomaly Detection via Hierarchical Dynamic Search

Arxiv

0+阅读 · 2022年4月20日

Latent Space Smoothing for Individually Fair Representations

Arxiv

0+阅读 · 2022年4月19日

I still have Time(s): Extending HeidelTime for German Texts

Arxiv

0+阅读 · 2022年4月19日

Detect-and-describe: Joint learning framework for detection and description of objects

Arxiv

0+阅读 · 2022年4月19日

"Flux+Mutability": A Conditional Generative Approach to One-Class Classification and Anomaly Detection

Arxiv

0+阅读 · 2022年4月19日

BLEWhisperer: Exploiting BLE Advertisements for Data Exfiltration

Arxiv

0+阅读 · 2022年4月17日

Efficient Spatial Representation and Routing of Deformable One-Dimensional Objects for Manipulation

Arxiv

0+阅读 · 2022年4月16日

Detecting Violence in Video Based on Deep Features Fusion Technique

Detecting Violence in Video Based on Deep Features Fusion Technique

Arxiv

0+阅读 · 2022年4月15日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Automatically Designing CNN Architectures for Medical Image Segmentation

Automatically Designing CNN Architectures for Medical Image Segmentation

Arxiv

10+阅读 · 2018年7月19日

VIP会员

文章信息

相关主题

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《分析与预测陆军战斗体能测试表现：统计与机器学习方法》2025最新137页

《军事行动中的人机协同共同学习》2025最新文献

代理式人工智能时代的决策优势

《F/A-18机队替换中队仿真模型的设计与分析》2025最新73页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

相关论文

Composite Anomaly Detection via Hierarchical Dynamic Search

Arxiv

0+阅读 · 2022年4月20日

Latent Space Smoothing for Individually Fair Representations

Arxiv

0+阅读 · 2022年4月19日

I still have Time(s): Extending HeidelTime for German Texts

Arxiv

0+阅读 · 2022年4月19日

Detect-and-describe: Joint learning framework for detection and description of objects

Arxiv

0+阅读 · 2022年4月19日

"Flux+Mutability": A Conditional Generative Approach to One-Class Classification and Anomaly Detection

Arxiv

0+阅读 · 2022年4月19日

BLEWhisperer: Exploiting BLE Advertisements for Data Exfiltration

Arxiv

0+阅读 · 2022年4月17日

Efficient Spatial Representation and Routing of Deformable One-Dimensional Objects for Manipulation

Arxiv

0+阅读 · 2022年4月16日

Detecting Violence in Video Based on Deep Features Fusion Technique

Detecting Violence in Video Based on Deep Features Fusion Technique

Arxiv

0+阅读 · 2022年4月15日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Automatically Designing CNN Architectures for Medical Image Segmentation

Automatically Designing CNN Architectures for Medical Image Segmentation

Arxiv

10+阅读 · 2018年7月19日

相关基金

基于声发射信号特征的高速焊凝固热裂纹在线检测方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

面向成像差异的高精度强适应SAR景象匹配算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于变分结构纹理分解的超分辨率图像复原方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于聚合物悬挂波导的可见光波段集成型传感器的研究

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

芍药切花ACS和ETR1基因的克隆、时空表达及功能分析

国家自然科学基金

0+阅读 · 2012年12月31日

滇西老厂富银红土型锰矿次生富集机制及40Ar/39Ar年龄

国家自然科学基金

0+阅读 · 2012年12月31日

嵌段共聚物多级自组装模拟分子伴侣的结构与功能

国家自然科学基金

1+阅读 · 2011年12月31日

关系的分解与Domain的表示

国家自然科学基金

1+阅读 · 2011年12月31日

烟草内生菌多样性研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员