Makerrere电台演讲公司:一个自动语音识别的Luganda电台公司 (The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition) - 专知论文

会员服务 ·

0

自动语音识别 · 语音识别 · 知识 (knowledge) · Performer · 可理解性 ·

2022 年 6 月 20 日

The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition

翻译：Makerrere电台演讲公司:一个自动语音识别的Luganda电台公司

Jonathan Mukiibi,Andrew Katumba,Joyce Nakatumba-Nabende,Ali Hussein,Josh Meyer

from arxiv, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022), pages 1945 to 1954 Marseille, 20 to 25 June 2022

Building a usable radio monitoring automatic speech recognition (ASR) system is a challenging task for under-resourced languages and yet this is paramount in societies where radio is the main medium of public communication and discussions. Initial efforts by the United Nations in Uganda have proved how understanding the perceptions of rural people who are excluded from social media is important in national planning. However, these efforts are being challenged by the absence of transcribed speech datasets. In this paper, The Makerere Artificial Intelligence research lab releases a Luganda radio speech corpus of 155 hours. To our knowledge, this is the first publicly available radio dataset in sub-Saharan Africa. The paper describes the development of the voice corpus and presents baseline Luganda ASR performance results using Coqui STT toolkit, an open source speech recognition toolkit.

翻译：建立可用的无线电监测自动语音识别系统(ASR)对于资源不足的语言来说是一项艰巨的任务,然而,在无线电是公共交流和讨论的主要媒介的社会中,这是极其重要的;联合国在乌干达的初步努力证明,了解被排除在社会媒体之外的农村人口的看法在国家规划中的重要性;然而,这些努力因没有转录的语音数据集而面临挑战;在本文件中,Makerere 人工智能研究实验室发布了一个Luganda无线电语音成套材料,长达155小时;据我们所知,这是撒哈拉以南非洲第一个公开提供的无线电数据集;该文件介绍了语音资料的开发情况,并介绍了使用开放源的语音识别工具包CoquiTT工具包的基线Luganda ASR工作成绩。

0

相关内容

自动语音识别

自动语音识别

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

汉藏双语个性化多语种语音合成中的语言建模的研究

国家自然科学基金

0+阅读 · 2012年12月31日

MFHAS1通过ERK信号转导通路对脓毒症小鼠T淋巴细胞的作用及机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

傣语文本分析与语音合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

维吾尔语文本驱动的聋哑人手语合成技术的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Towards Cross-speaker Reading Style Transfer on Audiobook Dataset

Towards Cross-speaker Reading Style Transfer on Audiobook Dataset

Arxiv

0+阅读 · 2022年8月10日

Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectrum Disorders

Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectrum Disorders

Arxiv

0+阅读 · 2022年8月5日

Knowledge Graph Transfer Network for Few-Shot Recognition

Arxiv

15+阅读 · 2019年11月21日

CommanderSong: A Systematic Approach for Practical Adversarial Voice Recognition

Arxiv

14+阅读 · 2018年1月24日

Towards Understanding and Answering Multi-Sentence Recommendation Questions on Tourism

Arxiv

15+阅读 · 2018年1月5日

VIP会员

文章信息

相关主题

自动语音识别

知识 (knowledge)

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

美海军作战管理系统：变革战场空间的二十年

《任务与武器驱动美海军舰队设计》报告

俄罗斯“沙希德”/“天竺葵”攻击无人机

《利用动态图对网络攻击进行建模与仿真：在云安全评估中的应用》90页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

相关论文

Towards Cross-speaker Reading Style Transfer on Audiobook Dataset

Towards Cross-speaker Reading Style Transfer on Audiobook Dataset

Arxiv

0+阅读 · 2022年8月10日

Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectrum Disorders

Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectrum Disorders

Arxiv

0+阅读 · 2022年8月5日

Knowledge Graph Transfer Network for Few-Shot Recognition

Arxiv

15+阅读 · 2019年11月21日

CommanderSong: A Systematic Approach for Practical Adversarial Voice Recognition

Arxiv

14+阅读 · 2018年1月24日

Towards Understanding and Answering Multi-Sentence Recommendation Questions on Tourism

Arxiv

15+阅读 · 2018年1月5日

相关基金

β-catenin/Ets1复合体在胶质母细胞瘤中对hTERT表达调控机制的研究

国家自然科学基金

0+阅读 · 2013年12月31日

汉藏双语个性化多语种语音合成中的语言建模的研究

国家自然科学基金

0+阅读 · 2012年12月31日

MFHAS1通过ERK信号转导通路对脓毒症小鼠T淋巴细胞的作用及机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

傣语文本分析与语音合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

维吾尔语文本驱动的聋哑人手语合成技术的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员