用于代码转换自动语音识别的衡量尺度 (Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition) - 专知论文

会员服务 ·

0

语音识别 · 相关系数 · 自动语音识别 · 有向 · Facebook AI Research ·

2022 年 11 月 22 日

Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition

翻译：用于代码转换自动语音识别的衡量尺度

Injy Hamed,Amir Hussein,Oumnia Chellah,Shammur Chowdhury,Hamdy Mubarak,Sunayana Sitaram,Nizar Habash,Ahmed Ali

from arxiv, Accepted to SLT 2022

Code-switching poses a number of challenges and opportunities for multilingual automatic speech recognition. In this paper, we focus on the question of robust and fair evaluation metrics. To that end, we develop a reference benchmark data set of code-switching speech recognition hypotheses with human judgments. We define clear guidelines for minimal editing of automatic hypotheses. We validate the guidelines using 4-way inter-annotator agreement. We evaluate a large number of metrics in terms of correlation with human judgments. The metrics we consider vary in terms of representation (orthographic, phonological, semantic), directness (intrinsic vs extrinsic), granularity (e.g. word, character), and similarity computation method. The highest correlation to human judgment is achieved using transliteration followed by text normalization. We release the first corpus for human acceptance of code-switching speech recognition results in dialectal Arabic/English conversation speech.

翻译：代码转换为多语种自动语音识别带来了许多挑战和机遇。在本文中,我们侧重于稳健和公正的评价指标问题。为此,我们开发了一套参考基准数据,根据人文判断来设定密码转换语音识别假设的假设。我们为自动假设的最小编辑制定了明确的指导方针。我们使用四向间翻译协议来验证准则。我们从与人类判断的相关性的角度来评估大量衡量标准。我们认为在代表性(体格学、声学、语义)、直接性(异性与外性)、颗粒性(如字词、特性)和相似性计算方法方面,衡量标准各不相同。与人类判断的最高相关性是在文本正常化之后通过翻写来实现的。我们发布了关于人类在方言阿拉伯语/英语谈话中接受密码转换语音识别结果的第一套材料。我们发布了关于人类接受方言语阿拉伯语/英语语音识别结果的第一套材料。

0

相关内容

语音识别

语音识别是计算机科学和计算语言学的一个跨学科子领域，它发展了一些方法和技术，使计算机可以将口语识别和翻译成文本。它也被称为自动语音识别（ASR），计算机语音识别或语音转文本（STT）。它整合了计算机科学，语言学和计算机工程领域的知识和研究。

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

回声干扰抑制中的自适应信号处理算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于Petri网的自动制造系统分布式控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于有限容量Petri网的离散事件系统监控理论

国家自然科学基金

0+阅读 · 2012年12月31日

基于响应灵敏度分析的结构损伤识别新方法及实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于多核机群的Petri网并行算法的研究与实现

国家自然科学基金

0+阅读 · 2011年12月31日

Benchmarking Large Language Models for News Summarization

Arxiv

0+阅读 · 2023年1月31日

New Metrics to Encourage Innovation and Diversity in Information Retrieval Approaches

Arxiv

0+阅读 · 2023年1月30日

Quality Evaluation of Arbitrary Style Transfer: Subjective Study and Objective Metric

Arxiv

0+阅读 · 2023年1月29日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

VIP会员

文章信息

相关主题

自动语音识别

Facebook AI Research

相关VIP内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【伯克利博士论文】从推理服务到模型训练：面向大规模 LLM 智能体的高效系统构建

面向作战人员负责任地寻求生成式人工智能

《Hello-Agents》项目正式发布，一起从零学习智能体！

智能体 AI (Agentic AI) 的新进展：回归初心，预见未来

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

相关论文

Benchmarking Large Language Models for News Summarization

Arxiv

0+阅读 · 2023年1月31日

New Metrics to Encourage Innovation and Diversity in Information Retrieval Approaches

Arxiv

0+阅读 · 2023年1月30日

Quality Evaluation of Arbitrary Style Transfer: Subjective Study and Objective Metric

Arxiv

0+阅读 · 2023年1月29日

Recovering 3D Human Mesh from Monocular Images: A Survey

Arxiv

12+阅读 · 2022年3月8日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

相关基金

回声干扰抑制中的自适应信号处理算法研究

国家自然科学基金

1+阅读 · 2015年12月31日

基于Petri网的自动制造系统分布式控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于有限容量Petri网的离散事件系统监控理论

国家自然科学基金

0+阅读 · 2012年12月31日

基于响应灵敏度分析的结构损伤识别新方法及实验研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于多核机群的Petri网并行算法的研究与实现

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员