TALCS: 开放源码普通普通话-英语代码转换体和语音识别基线 (TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline) - 专知论文

会员服务 ·

0

语音识别 · 基准 · 知识 (knowledge) · Performer · 自动语音识别 ·

2022 年 6 月 27 日

TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline

翻译：TALCS: 开放源码普通普通话-英语代码转换体和语音识别基线

Chengfei Li,Shuhao Deng,Yaoping Wang,Guangjing Wang,Yaguang Gong,Changbin Chen,Jinfeng Bai

from arxiv, accepted by INTERSPEECH 2022

This paper introduces a new corpus of Mandarin-English code-switching speech recognition--TALCS corpus, suitable for training and evaluating code-switching speech recognition systems. TALCS corpus is derived from real online one-to-one English teaching scenes in TAL education group, which contains roughly 587 hours of speech sampled at 16 kHz. To our best knowledge, TALCS corpus is the largest well labeled Mandarin-English code-switching open source automatic speech recognition (ASR) dataset in the world. In this paper, we will introduce the recording procedure in detail, including audio capturing devices and corpus environments. And the TALCS corpus is freely available for download under the permissive license1. Using TALCS corpus, we conduct ASR experiments in two popular speech recognition toolkits to make a baseline system, including ESPnet and Wenet. The Mixture Error Rate (MER) performance in the two speech recognition toolkits is compared in TALCS corpus. The experimental results implies that the quality of audio recordings and transcriptions are promising and the baseline system is workable.

翻译：本文介绍了一套新的普通话-英语密码转换语音识别-TALCS系统,适合培训和评价密码转换语音识别系统;TALCS系统来自TAL教育组实际的在线一对一英语教学场景,该组约有587小时的语音抽样,在16千赫兹16千赫兹。据我们所知,TALCS系统是全世界最大的有良好标签的普通话-英语密码转换开源自动语音识别数据集;在本文中,我们将引入详细记录程序,包括音频捕获装置和物质环境;TALCS系统可免费下载许可许可证1。我们利用TALCS系统,在两个流行语音识别工具包中进行ASR实验,以建立基线系统,包括ESPnet和Wenet。两个语音识别工具包中的混结错误率(MERS)表现在TALCSPS系统中进行了比较。实验结果表明,录音记录和抄录的质量很有希望,基线系统是可行的。

0

相关内容

语音识别

语音识别是计算机科学和计算语言学的一个跨学科子领域，它发展了一些方法和技术，使计算机可以将口语识别和翻译成文本。它也被称为自动语音识别（ASR），计算机语音识别或语音转文本（STT）。它整合了计算机科学，语言学和计算机工程领域的知识和研究。

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

γ-Synuclein调控MAPK-ERK-JNK信号通路及细胞周期促进子宫内膜癌恶性进展的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

人可溶性鸟苷酸环化酶介导一氧化氮信号转导的结构基础和调控分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

叶绿体类核区蛋白PUC1对PEP类型基因表达的分子调节机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

马尾松高抗旱家系应答干旱胁迫的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥叶绿体分裂相关基因AtGsc1的克隆及功能分析

国家自然科学基金

0+阅读 · 2011年12月31日

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition

Arxiv

0+阅读 · 2022年8月16日

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition

Arxiv

0+阅读 · 2022年8月15日

A Domain-Specific Language for Simulation-Based Testing of IoT Edge-to-Cloud Solutions

Arxiv

0+阅读 · 2022年8月15日

Compositional Synthesis of Modular Systems (Full Version)

Arxiv

0+阅读 · 2022年8月12日

Incorporating Dictionaries into Deep Neural Networks for the Chinese Clinical Named Entity Recognition

Arxiv

12+阅读 · 2018年4月13日

VIP会员

文章信息

相关主题

知识 (knowledge)

自动语音识别

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

【论文推荐】最新五篇命名实体识别相关论文—深度主动学习、Lattice LSTM、混合马尔可夫CRF

专知

26+阅读 · 2018年5月22日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

相关论文

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition

Arxiv

0+阅读 · 2022年8月16日

Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition

Arxiv

0+阅读 · 2022年8月15日

A Domain-Specific Language for Simulation-Based Testing of IoT Edge-to-Cloud Solutions

Arxiv

0+阅读 · 2022年8月15日

Compositional Synthesis of Modular Systems (Full Version)

Arxiv

0+阅读 · 2022年8月12日

Incorporating Dictionaries into Deep Neural Networks for the Chinese Clinical Named Entity Recognition

Arxiv

12+阅读 · 2018年4月13日

相关基金

γ-Synuclein调控MAPK-ERK-JNK信号通路及细胞周期促进子宫内膜癌恶性进展的机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

人可溶性鸟苷酸环化酶介导一氧化氮信号转导的结构基础和调控分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

叶绿体类核区蛋白PUC1对PEP类型基因表达的分子调节机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

马尾松高抗旱家系应答干旱胁迫的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

拟南芥叶绿体分裂相关基因AtGsc1的克隆及功能分析

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员