掌上桨:一个方便使用的 " 一对一 " 演讲工具箱 (PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit) - 专知论文

会员服务 ·

0

Performer · state-of-the-art · SimPLe · HTTPS · 代码 ·

2022 年 5 月 20 日

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

翻译：掌上桨:一个方便使用的 " 一对一 " 演讲工具箱

Hui Zhang,Tian Yuan,Junkun Chen,Xintong Li,Renjie Zheng,Yuxin Huang,Xiaojie Chen,Enlei Gong,Zeyu Chen,Xiaoguang Hu,Dianhai Yu,Yanjun Ma,Liang Huang

PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech.

翻译：PadleSpeech是一个开放源码全在语音工具箱,目的是通过提供易于使用的指令线界面和一个简单的代码结构,促进语音处理技术的开发和研究。本文描述了PaddleSpeech的设计哲学和核心结构,以支持几项基本的语音到文字和文本到语音任务。PaddleSpeech在各种语音数据集上实现了竞争性或最先进的表现,并采用了最受欢迎的方法。它还提供了快速复制本文实验结果的食谱和预先培训的模型。PaddleSpeech可以在https://github.com/PaddlePaddle/PadleSpeech上公开发表。

0

相关内容

Performer

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

关于若干模型泛函不等式及其应用的研究

国家自然科学基金

1+阅读 · 2015年12月31日

解析函数空间上的Toeplitz型奇异积分算子

国家自然科学基金

0+阅读 · 2014年12月31日

调和稳定-GARCH模型下期权定价和风险管理研究

国家自然科学基金

0+阅读 · 2013年12月31日

共形曲面的谱簇的渐近分析

国家自然科学基金

0+阅读 · 2011年12月31日

原癌基因AEG-1调控胶质瘤细胞凋亡的生物学功能及其分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

Audio-Visual Segmentation

Audio-Visual Segmentation

Arxiv

0+阅读 · 2022年7月11日

QKVA grid: Attention in Image Perspective and Stacked DETR

Arxiv

0+阅读 · 2022年7月9日

A Medical Information Extraction Workbench to Process German Clinical Text

A Medical Information Extraction Workbench to Process German Clinical Text

Arxiv

0+阅读 · 2022年7月8日

A domain-specific language for describing machine learning datasets

Arxiv

0+阅读 · 2022年7月8日

Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment

Arxiv

0+阅读 · 2022年7月8日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

智能体工程（Agent Engineering）

《全球地缘政治环境中的反无人机系统互操作性》252页

专业软件开发者不靠“氛围编程”（Vibe Coding），而靠“控制”：2025 年 AI Agent 在编程中的应用研究

基于大语言模型的智能体化软件问题解决：综述

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Audio-Visual Segmentation

Audio-Visual Segmentation

Arxiv

0+阅读 · 2022年7月11日

QKVA grid: Attention in Image Perspective and Stacked DETR

Arxiv

0+阅读 · 2022年7月9日

A Medical Information Extraction Workbench to Process German Clinical Text

A Medical Information Extraction Workbench to Process German Clinical Text

Arxiv

0+阅读 · 2022年7月8日

A domain-specific language for describing machine learning datasets

Arxiv

0+阅读 · 2022年7月8日

Exploring the Effectiveness of Video Perceptual Representation in Blind Video Quality Assessment

Arxiv

0+阅读 · 2022年7月8日

相关基金

关于若干模型泛函不等式及其应用的研究

国家自然科学基金

1+阅读 · 2015年12月31日

解析函数空间上的Toeplitz型奇异积分算子

国家自然科学基金

0+阅读 · 2014年12月31日

调和稳定-GARCH模型下期权定价和风险管理研究

国家自然科学基金

0+阅读 · 2013年12月31日

共形曲面的谱簇的渐近分析

国家自然科学基金

0+阅读 · 2011年12月31日

原癌基因AEG-1调控胶质瘤细胞凋亡的生物学功能及其分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员