Word 错误率是否是印度语语音识别的好评价指标? (Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?) - 专知论文

会员服务 ·

0

错误率 · 语音识别 · MoDELS · 语言模型化 · 自动语音识别 ·

2022 年 6 月 13 日

Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?

翻译：Word 错误率是否是印度语语音识别的好评价指标?

Priyanshi Shah,Harveen Singh Chadha,Anirudh Gupta,Ankur Dhuriya,Neeraj Chhimwal,Rishabh Gaur,Vivek Raghavan

from arxiv, This paper was submitted to Interspeech 2022

We propose a new method for the calculation of error rates in Automatic Speech Recognition (ASR). This new metric is for languages that contain half characters and where the same character can be written in different forms. We implement our methodology in Hindi which is one of the main languages from Indic context and we think this approach is scalable to other similar languages containing a large character set. We call our metrics Alternate Word Error Rate (AWER) and Alternate Character Error Rate (ACER). We train our ASR models using wav2vec 2.0\cite{baevski2020wav2vec} for Indic languages. Additionally we use language models to improve our model performance. Our results show a significant improvement in analyzing the error rates at word and character level and the interpretability of the ASR system is improved upto $3$\% in AWER and $7$\% in ACER for Hindi. Our experiments suggest that in languages which have complex pronunciation, there are multiple ways of writing words without changing their meaning. In such cases AWER and ACER will be more useful rather than WER and CER as metrics. Further, we open source a new benchmarking dataset of 21 hours for Hindi with the new metric scripts.

翻译：我们提出了一种计算自动语音识别误差率的新方法。这个新的衡量标准针对的是含有半个字符且可以以不同形式写出相同字符的语言。我们用印地语实施我们的方法,印地语是印地语中的主要语言之一,我们认为这个方法可以推广到包含大字符组的其他类似语言。我们称为“替代单词错误率”和替代字符错误率(ACER)的衡量方法。我们用 wav2vec 2.0\cite{baevski20wev2vec}为印地语语言培训了我们的ASR模型。此外,我们使用语言模型改进了我们的模型性能。我们的结果显示,在分析文字和字符级的误差率方面有了重大改进,而且ASR系统的可解释性能在AWER上提高到3美元,印地语的ACER上提高到7美元。我们的实验表明,在具有复杂读音率的语文中,有多种写词的方式,但不会改变其含义。在这种情况下,AWER和ACER将比WER和CER更有用,而不是作为衡量标准。此外,我们打开了21个新版本的版本的版本版本版本,用于新版本。

0

相关内容

错误率

指分类错误的样本数占样本总数的比例。

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

SnO2/KNN纳米纤维的力电耦合效应与主动式氢敏机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

多功能稀土纳米探针用于VX2瘤靶向性CT造影剂和放疗增敏剂的影像学研究

国家自然科学基金

0+阅读 · 2014年12月31日

光磁双功能石墨烯基尖晶石纳米晶体的形成机理及性能调控

国家自然科学基金

0+阅读 · 2014年12月31日

基于RUS方法的稳定各向同性负泊松比合金材料的研制及特性分析

国家自然科学基金

0+阅读 · 2013年12月31日

海洋天然产物Lamellarin D糖基化衍生物的合成与构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

功能化石墨烯量子点合成与荧光传感

国家自然科学基金

0+阅读 · 2012年12月31日

基于靶向TCP-1多肽的多功能超顺磁氧化铁纳米粒子用于结肠癌早期诊断和治疗的研究

国家自然科学基金

0+阅读 · 2012年12月31日

蜂蜜四环素族成分的激光漫反射图像成像机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

超燃冲压发动机进排气系统流场的PIV实验研究

国家自然科学基金

0+阅读 · 2008年12月31日

气－固反应制备IyCo4Sb12/SnO2 纳米复合材料及其热电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion

Arxiv

0+阅读 · 2022年8月2日

Self-supervised Group Meiosis Contrastive Learning for EEG-Based Emotion Recognition

Arxiv

0+阅读 · 2022年8月2日

SMART: Sentences as Basic Units for Text Evaluation

Arxiv

0+阅读 · 2022年8月1日

Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

Arxiv

0+阅读 · 2022年7月29日

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

Arxiv

0+阅读 · 2022年7月29日

Language Models Can Teach Themselves to Program Better

Arxiv

0+阅读 · 2022年7月29日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

Reasoning on Knowledge Graphs with Debate Dynamics

Reasoning on Knowledge Graphs with Debate Dynamics

Arxiv

14+阅读 · 2020年1月2日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

VIP会员

文章信息

相关主题

语言模型化

自动语音识别

相关VIP内容

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津大学博士论文】将序列结构与几何结构融入深度神经网络

工程视角：影响战争进程的小型无人机

企业级AI应用开发：从技术选型到生产落地

AI生成代码缺陷综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

相关论文

An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion

Arxiv

0+阅读 · 2022年8月2日

Self-supervised Group Meiosis Contrastive Learning for EEG-Based Emotion Recognition

Arxiv

0+阅读 · 2022年8月2日

SMART: Sentences as Basic Units for Text Evaluation

Arxiv

0+阅读 · 2022年8月1日

Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

Arxiv

0+阅读 · 2022年7月29日

Learning Phone Recognition from Unpaired Audio and Phone Sequences Based on Generative Adversarial Network

Arxiv

0+阅读 · 2022年7月29日

Language Models Can Teach Themselves to Program Better

Arxiv

0+阅读 · 2022年7月29日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Unsupervised Domain Clusters in Pretrained Language Models

Arxiv

11+阅读 · 2020年4月5日

Reasoning on Knowledge Graphs with Debate Dynamics

Reasoning on Knowledge Graphs with Debate Dynamics

Arxiv

14+阅读 · 2020年1月2日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

相关基金

SnO2/KNN纳米纤维的力电耦合效应与主动式氢敏机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

多功能稀土纳米探针用于VX2瘤靶向性CT造影剂和放疗增敏剂的影像学研究

国家自然科学基金

0+阅读 · 2014年12月31日

光磁双功能石墨烯基尖晶石纳米晶体的形成机理及性能调控

国家自然科学基金

0+阅读 · 2014年12月31日

基于RUS方法的稳定各向同性负泊松比合金材料的研制及特性分析

国家自然科学基金

0+阅读 · 2013年12月31日

海洋天然产物Lamellarin D糖基化衍生物的合成与构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

功能化石墨烯量子点合成与荧光传感

国家自然科学基金

0+阅读 · 2012年12月31日

基于靶向TCP-1多肽的多功能超顺磁氧化铁纳米粒子用于结肠癌早期诊断和治疗的研究

国家自然科学基金

0+阅读 · 2012年12月31日

蜂蜜四环素族成分的激光漫反射图像成像机理研究

国家自然科学基金

0+阅读 · 2009年12月31日

超燃冲压发动机进排气系统流场的PIV实验研究

国家自然科学基金

0+阅读 · 2008年12月31日

气－固反应制备IyCo4Sb12/SnO2 纳米复合材料及其热电性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员