WOLONet:高效和高忠心演说综合 " 浪潮展望 " (WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis) - 专知论文

会员服务 ·

0

逼真度 · motivation · 核化 · SOTA · 语音合成 ·

2022 年 6 月 20 日

WOLONet: Wave Outlooker for Efficient and High Fidelity Speech Synthesis

翻译：WOLONet:高效和高忠心演说综合 " 浪潮展望 "

Recently, GAN-based neural vocoders such as Parallel WaveGAN, MelGAN, HiFiGAN, and UnivNet have become popular due to their lightweight and parallel structure, resulting in a real-time synthesized waveform with high fidelity, even on a CPU. HiFiGAN and UnivNet are two SOTA vocoders. Despite their high quality, there is still room for improvement. In this paper, motivated by the structure of Vision Outlooker from computer vision, we adopt a similar idea and propose an effective and lightweight neural vocoder called WOLONet. In this network, we develop a novel lightweight block that uses a location-variable, channel-independent, and depthwise dynamic convolutional kernel with sinusoidally activated dynamic kernel weights. To demonstrate the effectiveness and generalizability of our method, we perform an ablation study to verify our novel design and make a subjective and objective comparison with typical GAN-based vocoders. The results show that our WOLONet achieves the best generation quality while requiring fewer parameters than the two neural SOTA vocoders, HiFiGAN and UnivNet.

翻译：最近,基于GAN的Neal vocation vocular vocideers,如Plaine WaveGAN、MelGAN、HiFiGAN和UnivNet等基于GAN的神经立体最近由于它们的轻重和平行结构而变得很受欢迎,导致一个实时合成的波形,具有高度忠诚,甚至在CPU上也是如此。HiFiGAN和UnivNet是两个SOTA的立体。尽管它们质量很高,但仍有改进的余地。在本文中,由于计算机视野展望者的结构,我们采用了类似的想法,并提出了一个有效和轻量的神经伏变体,称为WOOLONet。在这个网络中,我们开发了一个新型的轻质区块,使用一个位置可变的、不依赖频道的和深度动态共振动内核,并配有正态的动态内核重量。为了展示我们的方法的有效性和可概括性,我们进行了一项相关研究,以核实我们的新设计,并与典型的GAN基电动的电动电动电动电动电动电解器进行主观和客观的比较。结果显示,我们的WOLOLONet在需要比UFIFAG低的参数和NG的参数小于G。

0

相关内容

逼真度

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

电场调制增强型AlGaN/GaN HEMT关键技术研究

国家自然科学基金

0+阅读 · 2017年12月31日

大规模分数阶微分系统的高性能并行算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

超材料耦合吸收增强量子级联红外探测器研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于GaN纳米线的光化学电池型自供能紫外探测器的制备与性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

亚波长空气孔传光的新型空芯光子晶体光纤设计、制备及特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

Nb3Al超导材料多场耦合电磁本构实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

低维AlGaN异质界面微结构及其极化调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

超材料全平面多波束扫描天线阵理论和设计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience

Arxiv

0+阅读 · 2022年8月9日

Fast and High-Quality Image Denoising via Malleable Convolutions

Arxiv

0+阅读 · 2022年8月9日

DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents

Arxiv

0+阅读 · 2022年8月8日

Functional-Coefficient Models for Multivariate Time Series in Designed Experiments: with Applications to Brain Signals

Arxiv

0+阅读 · 2022年8月8日

Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Arxiv

0+阅读 · 2022年8月8日

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

Arxiv

0+阅读 · 2022年8月8日

Any-resolution Training for High-resolution Image Synthesis

Arxiv

0+阅读 · 2022年8月5日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Efficient Transformers: A Survey

Arxiv

23+阅读 · 2020年9月16日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience

Basis for Intentions: Efficient Inverse Reinforcement Learning using Past Experience

Arxiv

0+阅读 · 2022年8月9日

Fast and High-Quality Image Denoising via Malleable Convolutions

Arxiv

0+阅读 · 2022年8月9日

DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents

Arxiv

0+阅读 · 2022年8月8日

Functional-Coefficient Models for Multivariate Time Series in Designed Experiments: with Applications to Brain Signals

Arxiv

0+阅读 · 2022年8月8日

Depth Quality-Inspired Feature Manipulation for Efficient RGB-D and Video Salient Object Detection

Arxiv

0+阅读 · 2022年8月8日

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

Arxiv

0+阅读 · 2022年8月8日

Any-resolution Training for High-resolution Image Synthesis

Arxiv

0+阅读 · 2022年8月5日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

Efficient Transformers: A Survey

Arxiv

23+阅读 · 2020年9月16日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

相关基金

电场调制增强型AlGaN/GaN HEMT关键技术研究

国家自然科学基金

0+阅读 · 2017年12月31日

大规模分数阶微分系统的高性能并行算法研究

国家自然科学基金

0+阅读 · 2015年12月31日

超材料耦合吸收增强量子级联红外探测器研究

国家自然科学基金

0+阅读 · 2015年12月31日

Schr？dinger-Poisson方程守恒DDG方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

基于GaN纳米线的光化学电池型自供能紫外探测器的制备与性能研究

国家自然科学基金

0+阅读 · 2013年12月31日

亚波长空气孔传光的新型空芯光子晶体光纤设计、制备及特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

Nb3Al超导材料多场耦合电磁本构实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

低维AlGaN异质界面微结构及其极化调控研究

国家自然科学基金

0+阅读 · 2012年12月31日

超材料全平面多波束扫描天线阵理论和设计方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员