瑞士德语对文本系统评价的德文演讲 (Swiss German Speech to Text system evaluation) - 专知论文

会员服务 ·

0

BLEU · Analysis · 得分 · MoDELS · 情景 ·

2022 年 7 月 1 日

Swiss German Speech to Text system evaluation

翻译：瑞士德语对文本系统评价的德文演讲

Yanick Schraner,Christian Scheller,Michel Plüss,Manfred Vogel

from arxiv, arXiv admin note: text overlap with arXiv:2205.09501

We present an in-depth evaluation of four commercially available Speech-to-Text (STT) systems for Swiss German. The systems are anonymized and referred to as system a-d in this report. We compare the four systems to our STT model, referred to as FHNW from hereon after, and provide details on how we trained our model. To evaluate the models, we use two STT datasets from different domains. The Swiss Parliament Corpus (SPC) test set and a private dataset in the news domain with an even distribution across seven dialect regions. We provide a detailed error analysis to detect the three systems' strengths and weaknesses. This analysis is limited by the characteristics of the two test sets. Our model scored the highest bilingual evaluation understudy (BLEU) on both datasets. On the SPC test set, we obtain a BLEU score of 0.607, whereas the best commercial system reaches a BLEU score of 0.509. On our private test set, we obtain a BLEU score of 0.722 and the best commercial system a BLEU score of 0.568.

翻译：我们为瑞士德国人提供了四个商业上可用的语音到文字系统(STT)的深入评价。这些系统匿名,在本报告中被称为A-d系统。我们比较了四个系统与我们的STT模型,从后面称为FHNW, 并详细介绍了我们如何培训我们的模型。为了评价模型,我们使用了两个不同领域的STT数据集。瑞士议会Corpus(SPC)测试集和新闻域的私人数据集,平均分布于七个方言区域。我们提供了详细的错误分析,以发现三个系统的优缺点。这一分析受两个测试组特点的限制。我们的模型在这两个数据集中都获得了最高的双语评估。在SPC测试组中,我们获得了0.607的BLEU分,而最佳商业系统达到0.509的BLEU分。在我们私人测试组中,我们获得了0.722的BLEU分,而最佳商业系统获得0.568的BLEU分。

0

相关内容

BLEU

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

机器学习相关资源(框架、库、软件)大列表

机器学习相关资源(框架、库、软件)大列表

专知会员服务

40+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ARVCF调节cadherin/catenin复合体介导的细胞间黏附的分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

一类稳态Schödinger-Poisson-Slater方程标准化解的研究

国家自然科学基金

1+阅读 · 2015年12月31日

冲击波致微结构演化对钛合金力学性能的影响

国家自然科学基金

0+阅读 · 2015年12月31日

LAG-3负向调控HCV特异的CD8+T细胞免疫反应及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

莪术醇干预肝纤维化HSEC及其信号调控作用的研究

国家自然科学基金

0+阅读 · 2014年12月31日

MAPK调控采后香蕉果实成熟的分子机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有抗肿瘤活性愈创木烷型倍半萜类天然产物全合成

国家自然科学基金

0+阅读 · 2012年12月31日

贵金属修饰的石墨烯/陶瓷复合纳米纤维光电转换的协同机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Treg细胞对Th1、Th2、Th17细胞介导的眼内炎症的调节作用

国家自然科学基金

0+阅读 · 2009年12月31日

SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation

SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation

Arxiv

0+阅读 · 2022年8月23日

Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

Arxiv

0+阅读 · 2022年8月22日

The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

Arxiv

0+阅读 · 2022年8月22日

Selection Collider Bias in Large Language Models

Arxiv

0+阅读 · 2022年8月22日

Evaluating Out-of-Distribution Detectors Through Adversarial Generation of Outliers

Arxiv

0+阅读 · 2022年8月20日

Trigger-free Event Detection via Derangement Reading Comprehension

Arxiv

0+阅读 · 2022年8月20日

Cross-Domain Evaluation of a Deep Learning-Based Type Inference System

Arxiv

0+阅读 · 2022年8月19日

ARID: A New Dataset for Recognizing Action in the Dark

Arxiv

0+阅读 · 2022年8月19日

Using Large Language Models to Simulate Multiple Humans

Arxiv

0+阅读 · 2022年8月18日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

机器学习相关资源(框架、库、软件)大列表

机器学习相关资源(框架、库、软件)大列表

专知会员服务

40+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation

SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation

Arxiv

0+阅读 · 2022年8月23日

Dialogue Term Extraction using Transfer Learning and Topological Data Analysis

Arxiv

0+阅读 · 2022年8月22日

The GENEA Challenge 2022: A large evaluation of data-driven co-speech gesture generation

Arxiv

0+阅读 · 2022年8月22日

Selection Collider Bias in Large Language Models

Arxiv

0+阅读 · 2022年8月22日

Evaluating Out-of-Distribution Detectors Through Adversarial Generation of Outliers

Arxiv

0+阅读 · 2022年8月20日

Trigger-free Event Detection via Derangement Reading Comprehension

Arxiv

0+阅读 · 2022年8月20日

Cross-Domain Evaluation of a Deep Learning-Based Type Inference System

Arxiv

0+阅读 · 2022年8月19日

ARID: A New Dataset for Recognizing Action in the Dark

Arxiv

0+阅读 · 2022年8月19日

Using Large Language Models to Simulate Multiple Humans

Arxiv

0+阅读 · 2022年8月18日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

相关基金

ARVCF调节cadherin/catenin复合体介导的细胞间黏附的分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

Copine VII在阿尔茨海默病中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

一类稳态Schödinger-Poisson-Slater方程标准化解的研究

国家自然科学基金

1+阅读 · 2015年12月31日

冲击波致微结构演化对钛合金力学性能的影响

国家自然科学基金

0+阅读 · 2015年12月31日

LAG-3负向调控HCV特异的CD8+T细胞免疫反应及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

莪术醇干预肝纤维化HSEC及其信号调控作用的研究

国家自然科学基金

0+阅读 · 2014年12月31日

MAPK调控采后香蕉果实成熟的分子机理研究

国家自然科学基金

0+阅读 · 2013年12月31日

具有抗肿瘤活性愈创木烷型倍半萜类天然产物全合成

国家自然科学基金

0+阅读 · 2012年12月31日

贵金属修饰的石墨烯/陶瓷复合纳米纤维光电转换的协同机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Treg细胞对Th1、Th2、Th17细胞介导的眼内炎症的调节作用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员