以频谱断裂为焦点的自动后回归式 GAN 电解器 (A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture) - 专知论文

会员服务 ·

0

GaN · 层 · MoDELS · 增强现实（AR） · GANs ·

2023 年 2 月 16 日

A Post Auto-regressive GAN Vocoder Focused on Spectrum Fracture

翻译：以频谱断裂为焦点的自动后回归式 GAN 电解器

Zhenxing Lu,Mengnan He,Ruixiong Zhang,Caixia Gong

from arxiv, Experimental parts should be improved

Generative adversarial networks (GANs) have been indicated their superiority in usage of the real-time speech synthesis. Nevertheless, most of them make use of deep convolutional layers as their backbone, which may cause the absence of previous signal information. However, the generation of speech signals invariably require preceding waveform samples in its reconstruction, as the lack of this can lead to artifacts in generated speech. To address this conflict, in this paper, we propose an improved model: a post auto-regressive (AR) GAN vocoder with a self-attention layer, which merging self-attention in an AR loop. It will not participate in inference, but can assist the generator to learn temporal dependencies within frames in training. Furthermore, an ablation study was done to confirm the contribution of each part. Systematic experiments show that our model leads to a consistent improvement on both objective and subjective evaluation performance.

翻译：在实时语音合成中,人们已经指出,产生对抗性网络(GANs)在使用实时语音合成时具有优越性,但大多数网络都利用深演层作为其主干,这可能导致缺乏先前的信号信息;然而,生成语音信号必然需要先进行波形样本的重建,因为缺乏这种样本会导致生成的语音中的文物。为了解决这一冲突,我们在本文件中提出了一个改进模式:自动回归后(AR)GAN voder与自我注意层相结合,将自我关注结合到AAR循环中。它不会参与推论,但可以帮助生成者在培训中学习框架中的时间依赖性。此外,还进行了反调研究,以确认每个部分的贡献。系统实验表明,我们的模型导致客观和主观评价业绩的不断改进。

0

相关内容

GaN

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

利用贝叶斯方法估计LAMOST恒星参数

国家自然科学基金

2+阅读 · 2015年12月31日

Mipu1促血管新生的机制研究：对VEGF-VASH1/SVBP负反馈通路的转录调节

国家自然科学基金

0+阅读 · 2014年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于全氧富氢炼铁新工艺的焦炭劣化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

磷肥对再生水灌溉土壤重金属迁移转化的影响及调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

青藏高原森林-草地景观边界生物多样性和碳通量对模拟增温和氮沉降的响应

国家自然科学基金

0+阅读 · 2012年12月31日

TM4SF1调控Collagen/DDR1信号通路促进乳腺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

AlGaN基PIN太阳光盲雪崩探测器研究

国家自然科学基金

0+阅读 · 2008年12月31日

聚电解质逐层组装的自膨胀微凝胶脉冲释放系统的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Hyperspectral Image Super-Resolution via Dual-domain Network Based on Hybrid Convolution

Arxiv

0+阅读 · 2023年4月10日

Learning Residual Model of Model Predictive Control via Random Forests for Autonomous Driving

Arxiv

0+阅读 · 2023年4月10日

OpenDriver: an open-road driver state detection dataset

Arxiv

0+阅读 · 2023年4月9日

Compressed Regression over Adaptive Networks

Arxiv

0+阅读 · 2023年4月7日

ArmanTTS single-speaker Persian dataset

Arxiv

0+阅读 · 2023年4月7日

The Eyes Have It!: Using Human-Selected Features for Predicting Athletes' Performance

Arxiv

0+阅读 · 2023年4月6日

ALSO: Automotive Lidar Self-supervision by Occupancy estimation

Arxiv

0+阅读 · 2023年4月4日

Deterministic Performance Guarantees for Bidirectional BFS on Real-World Networks

Arxiv

0+阅读 · 2023年4月3日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

VIP会员

文章信息

相关主题

增强现实（AR）

相关VIP内容

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Hyperspectral Image Super-Resolution via Dual-domain Network Based on Hybrid Convolution

Arxiv

0+阅读 · 2023年4月10日

Learning Residual Model of Model Predictive Control via Random Forests for Autonomous Driving

Arxiv

0+阅读 · 2023年4月10日

OpenDriver: an open-road driver state detection dataset

Arxiv

0+阅读 · 2023年4月9日

Compressed Regression over Adaptive Networks

Arxiv

0+阅读 · 2023年4月7日

ArmanTTS single-speaker Persian dataset

Arxiv

0+阅读 · 2023年4月7日

The Eyes Have It!: Using Human-Selected Features for Predicting Athletes' Performance

Arxiv

0+阅读 · 2023年4月6日

ALSO: Automotive Lidar Self-supervision by Occupancy estimation

Arxiv

0+阅读 · 2023年4月4日

Deterministic Performance Guarantees for Bidirectional BFS on Real-World Networks

Arxiv

0+阅读 · 2023年4月3日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

Conditional Random Field and Deep Feature Learning for Hyperspectral Image Segmentation

Arxiv

11+阅读 · 2017年12月27日

相关基金

利用贝叶斯方法估计LAMOST恒星参数

国家自然科学基金

2+阅读 · 2015年12月31日

Mipu1促血管新生的机制研究：对VEGF-VASH1/SVBP负反馈通路的转录调节

国家自然科学基金

0+阅读 · 2014年12月31日

S3AGA样本（Spitzer-SDSS Spectral Atlas of Galaxies and AGNs)及其AGN研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于全氧富氢炼铁新工艺的焦炭劣化机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

磷肥对再生水灌溉土壤重金属迁移转化的影响及调控机制

国家自然科学基金

0+阅读 · 2012年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

青藏高原森林-草地景观边界生物多样性和碳通量对模拟增温和氮沉降的响应

国家自然科学基金

0+阅读 · 2012年12月31日

TM4SF1调控Collagen/DDR1信号通路促进乳腺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

AlGaN基PIN太阳光盲雪崩探测器研究

国家自然科学基金

0+阅读 · 2008年12月31日

聚电解质逐层组装的自膨胀微凝胶脉冲释放系统的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员