泰文Wav2Vec2.0和通用V8 (Thai Wav2Vec2.0 with CommonVoice V8) - 专知论文

会员服务 ·

0

语音识别 · MoDELS · Performer · 三元语法 · 语言模型化 ·

2022 年 8 月 9 日

Thai Wav2Vec2.0 with CommonVoice V8

翻译：泰文Wav2Vec2.0和通用V8

Wannaphong Phatthiyaphaibun,Chompakorn Chaksangchaichot,Peerat Limkonchotiwat,Ekapol Chuangsuwanich,Sarana Nutanong

Recently, Automatic Speech Recognition (ASR), a system that converts audio into text, has caught a lot of attention in the machine learning community. Thus, a lot of publicly available models were released in HuggingFace. However, most of these ASR models are available in English; only a minority of the models are available in Thai. Additionally, most of the Thai ASR models are closed-sourced, and the performance of existing open-sourced models lacks robustness. To address this problem, we train a new ASR model on a pre-trained XLSR-Wav2Vec model with the Thai CommonVoice corpus V8 and train a trigram language model to boost the performance of our ASR model. We hope that our models will be beneficial to individuals and the ASR community in Thailand.

翻译：最近,将音频转换成文字的系统自动语音识别系统(ASR)在机器学习界引起了极大关注,因此,在Hugging Face发布了许多公开的模型,但大多数ASR模型都以英语提供;只有极少数模型以泰文提供;此外,大多数泰国ASR模型都是封闭来源的,现有开放来源模型的性能缺乏活力;为解决这一问题,我们用泰国通用Viicecamps V8培训了一个新的ASR模型,与泰国通用Vicecampulation V8培训了三种语言模型,以提高我们ASR模型的性能。我们希望我们的模型将有益于泰国的个人和ASR社区。

0

相关内容

语音识别

语音识别是计算机科学和计算语言学的一个跨学科子领域，它发展了一些方法和技术，使计算机可以将口语识别和翻译成文本。它也被称为自动语音识别（ASR），计算机语音识别或语音转文本（STT）。它整合了计算机科学，语言学和计算机工程领域的知识和研究。

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

开放量子系统非马尔科夫动力学过程量子仿真研究

国家自然科学基金

0+阅读 · 2014年12月31日

PRC1和PRC2共同调控Snail介导的上皮-间质转换在胰腺肿瘤转移机制中的研究

国家自然科学基金

0+阅读 · 2014年12月31日

外加应力及含水蒸气环境中CoNiCrAlY涂层表面氧化层的生长机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Pictet–Spengler类反应机理的理论研究和新反应设计

国家自然科学基金

0+阅读 · 2013年12月31日

热致相分离法制备聚丙烯腈中空纤维膜及膜孔结构与成形机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming Voice Trigger

Arxiv

0+阅读 · 2022年10月6日

A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding

Arxiv

0+阅读 · 2022年10月3日

Neuro-Symbolic Procedural Planning with Commonsense Prompting

Arxiv

0+阅读 · 2022年10月3日

Neuro-Symbolic Causal Language Planning with Commonsense Prompting

Arxiv

0+阅读 · 2022年9月29日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

大模型推理时代的知识编辑

《利用人工智能对军事行动进行建模》

【MIT博士论文】加速科学发现的因果建模实践算法

机器人、无人机与实时影像：应对城市爆炸威胁的三大技术方案

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

WakeUpNet: A Mobile-Transformer based Framework for End-to-End Streaming Voice Trigger

Arxiv

0+阅读 · 2022年10月6日

A Fine-tuned Wav2vec 2.0/HuBERT Benchmark For Speech Emotion Recognition, Speaker Verification and Spoken Language Understanding

Arxiv

0+阅读 · 2022年10月3日

Neuro-Symbolic Procedural Planning with Commonsense Prompting

Arxiv

0+阅读 · 2022年10月3日

Neuro-Symbolic Causal Language Planning with Commonsense Prompting

Arxiv

0+阅读 · 2022年9月29日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

相关基金

开放量子系统非马尔科夫动力学过程量子仿真研究

国家自然科学基金

0+阅读 · 2014年12月31日

PRC1和PRC2共同调控Snail介导的上皮-间质转换在胰腺肿瘤转移机制中的研究

国家自然科学基金

0+阅读 · 2014年12月31日

外加应力及含水蒸气环境中CoNiCrAlY涂层表面氧化层的生长机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Pictet–Spengler类反应机理的理论研究和新反应设计

国家自然科学基金

0+阅读 · 2013年12月31日

热致相分离法制备聚丙烯腈中空纤维膜及膜孔结构与成形机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员