H_SD:自动语音识别任务新的混合评价指标 (Hybrid-SD (H_SD): A new hybrid evaluation metric for automatic speech recognition tasks) - 专知论文

会员服务 ·

0

语音识别 · 自动语音识别 · 错误率 · MoDELS · 相关系数 ·

2022 年 11 月 9 日

Hybrid-SD (H_SD): A new hybrid evaluation metric for automatic speech recognition tasks

翻译：H_SD:自动语音识别任务新的混合评价指标

Zitha Sasindran,Harsha Yelchuri,Supreeth Rao,T. V. Prabhakar

Many studies have examined the shortcomings of word error rate (WER) as an evaluation metric for automatic speech recognition (ASR) systems, particularly when used for spoken language understanding tasks such as intent recognition and dialogue systems. In this paper, we propose Hybrid-SD (H_SD), a new hybrid evaluation metric for ASR systems that takes into account both semantic correctness and error rate. To generate sentence dissimilarity scores (SD), we built a fast and lightweight SNanoBERT model using distillation techniques. Our experiments show that the SNanoBERT model is 25.9x smaller and 38.8x faster than SRoBERTa while achieving comparable results on well-known benchmarks. Hence, making it suitable for deploying with ASR models on edge devices. We also show that H_SD correlates more strongly with downstream tasks such as intent recognition and named-entity recognition (NER).

翻译：许多研究审查了单词错误率(WER)作为自动语音识别(ASR)系统评价指标的缺点,特别是在用于诸如意向识别和对话系统等口语理解任务时;在本文件中,我们建议采用混合-SD(H_SD),这是一种考虑到语义正确性和误差率的新混合评价指标;为了产生判决差异分数(SD),我们利用蒸馏技术建立了一个快速和轻量的SnanoBERT模型;我们的实验表明,SnanoBERTA模型比SROBERTA模型要小25.9x和38.8x,同时在众所周知的基准上取得可比较的结果。因此,我们建议采用混合-SD(H_SD),它适合于在边缘装置上部署ASR模型。我们还表明,H_SD(SD)与下游任务(例如意向识别和点名实体识别(NER)的关系更为密切。

0

相关内容

语音识别

语音识别是计算机科学和计算语言学的一个跨学科子领域，它发展了一些方法和技术，使计算机可以将口语识别和翻译成文本。它也被称为自动语音识别（ASR），计算机语音识别或语音转文本（STT）。它整合了计算机科学，语言学和计算机工程领域的知识和研究。

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

非对称人字形波纹板式换热器自激振荡流动与强化换热机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cer—SPK—S1P通路在动脉粥样硬化中的作用及田黄片的干预研究

国家自然科学基金

0+阅读 · 2015年12月31日

复杂异质智能网络的同步行为分析与可控性研究

国家自然科学基金

1+阅读 · 2014年12月31日

三维集成扰流式散热微流道与TSV力-电耦合作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition

Arxiv

0+阅读 · 2022年12月30日

Alignment-guided Temporal Attention for Video Action Recognition

Arxiv

0+阅读 · 2022年12月30日

Learning Representations for Masked Facial Recovery

Arxiv

0+阅读 · 2022年12月28日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

VIP会员

文章信息

相关主题

自动语音识别

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

美海军作战管理系统：变革战场空间的二十年

《任务与武器驱动美海军舰队设计》报告

俄罗斯“沙希德”/“天竺葵”攻击无人机

《利用动态图对网络攻击进行建模与仿真：在云安全评估中的应用》90页

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition

Arxiv

0+阅读 · 2022年12月30日

Alignment-guided Temporal Attention for Video Action Recognition

Arxiv

0+阅读 · 2022年12月30日

Learning Representations for Masked Facial Recovery

Arxiv

0+阅读 · 2022年12月28日

Towards Robust Visual Information Extraction in Real World: New Dataset and Novel Solution

Arxiv

10+阅读 · 2021年1月24日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

相关基金

非对称人字形波纹板式换热器自激振荡流动与强化换热机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cer—SPK—S1P通路在动脉粥样硬化中的作用及田黄片的干预研究

国家自然科学基金

0+阅读 · 2015年12月31日

复杂异质智能网络的同步行为分析与可控性研究

国家自然科学基金

1+阅读 · 2014年12月31日

三维集成扰流式散热微流道与TSV力-电耦合作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

实时安全关键系统的建模、仿真与验证

国家自然科学基金

1+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员