HuBERT-TR:以自我监督的演讲代表学习方式恢复土耳其自动语音识别 (HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning) - 专知论文

会员服务 ·

0

语音识别 · state-of-the-art · MoDELS · 自动语音识别 · 数据监管 ·

2022 年 10 月 27 日

HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning

翻译：HuBERT-TR:以自我监督的演讲代表学习方式恢复土耳其自动语音识别

Ali Safaya,Engin Erzin

from arxiv, Submitted to ICASSP2023

While the Turkish language is listed among low-resource languages, literature on Turkish automatic speech recognition (ASR) is relatively old. In this paper, we present HuBERT-TR, a speech representation model for Turkish, based on HuBERT. HuBERT-TR achieves state-of-the-art results on several Turkish ASR datasets. We investigate pre-training HuBERT for Turkish with large-scale data curated from online resources. We pre-train HuBERT-TR using over 6,500 hours of speech data curated from YouTube that includes extensive variability in terms of quality and genre. We show that language-specific models are superior to other pre-trained models, where our Turkish model HuBERT-TR/base performs better than the x10 times larger state-of-the-art multilingual XLS-R-1b model in low-resource settings. Moreover, we study the effect of scaling on ASR performance by scaling our models up to 1B parameters. Our best model yields a state-of-the-art word error rate of 4.97% on the Turkish Broadcast News dataset. Models are available at https://huggingface.co/asafaya

翻译：虽然土耳其语言列在低资源语言中,但土耳其自动语音识别(ASR)文献相对陈旧。在本文中,我们介绍了基于HuBERT的土耳其语言代表模式HuBERT-TR。HuBERT-TR在几个土耳其的ASR数据集中取得了最新成果。我们调查了土耳其语语言识别(HuBERT-TR)的预培训(HUBERT),其大规模数据来自在线资源。我们使用YouTube6500多小时的语音数据进行预培训(HuBERT-TR),这些数据在质量和类型上具有广泛的差异性。我们显示,语言特定模式优于其他培训前模式,即我们的土耳其模式HuBERT-TR/Base在低资源环境中比最先进的x10倍的多语言XLS-R-1b模型表现更好。此外,我们研究了通过将模型提升到1B参数来提升ASR性能的效果。我们的最佳模型在土耳其广播新闻数据集中产生4.97 %的状态、最先进的单词错误率。模型可在https://huphaceco.

0

相关内容

语音识别

语音识别是计算机科学和计算语言学的一个跨学科子领域，它发展了一些方法和技术，使计算机可以将口语识别和翻译成文本。它也被称为自动语音识别（ASR），计算机语音识别或语音转文本（STT）。它整合了计算机科学，语言学和计算机工程领域的知识和研究。

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

50+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

LOXL1基因多态性与DNA甲基化在新疆维吾尔族剥脱综合征白内障发病过程中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

单链DNA结合蛋白WHIRLY1转录及表观遗传调控植物衰老和细胞死亡的研究

国家自然科学基金

0+阅读 · 2014年12月31日

CoFe2O4/BaSrTiO3复合势垒多铁隧道结的制备及隧穿特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

紫外线诱导系统性红斑狼疮DNA甲基化的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

天花粉蛋白调控miRNAs甲基化抑制宫颈癌上皮间质转化的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

番茄泛素连接酶CUL4-DDB1调控DNA甲基化的分子机理

国家自然科学基金

0+阅读 · 2011年12月31日

镉胁迫诱导拟南芥细胞内MLH1和MSH2基因突变及甲基化改变的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

蛋白质组学技术筛选生物标志物诊断污染土壤的生态毒性

国家自然科学基金

0+阅读 · 2008年12月31日

S^2-Transformer for Mask-Aware Hyperspectral Image Reconstruction

Arxiv

0+阅读 · 2022年12月14日

Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data

Arxiv

0+阅读 · 2022年12月13日

Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning

Arxiv

0+阅读 · 2022年12月13日

Cross-Modal Learning with 3D Deformable Attention for Action Recognition

Arxiv

0+阅读 · 2022年12月12日

ADTR: Anomaly Detection Transformer with Feature Reconstruction

Arxiv

0+阅读 · 2022年12月9日

Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement

Arxiv

15+阅读 · 2021年6月3日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

Arxiv

13+阅读 · 2019年11月14日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

VIP会员

文章信息

相关主题

state-of-the-art

自动语音识别

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

50+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

S^2-Transformer for Mask-Aware Hyperspectral Image Reconstruction

Arxiv

0+阅读 · 2022年12月14日

Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data

Arxiv

0+阅读 · 2022年12月13日

Complete-to-Partial 4D Distillation for Self-Supervised Point Cloud Sequence Representation Learning

Arxiv

0+阅读 · 2022年12月13日

Cross-Modal Learning with 3D Deformable Attention for Action Recognition

Arxiv

0+阅读 · 2022年12月12日

ADTR: Anomaly Detection Transformer with Feature Reconstruction

Arxiv

0+阅读 · 2022年12月9日

Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement

Arxiv

15+阅读 · 2021年6月3日

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Adversarial Machine Learning in Image Classification: A Survey Towards the Defender's Perspective

Arxiv

17+阅读 · 2020年9月8日

Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

Arxiv

13+阅读 · 2019年11月14日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

Meta Learning for End-to-End Low-Resource Speech Recognition

Meta Learning for End-to-End Low-Resource Speech Recognition

Arxiv

20+阅读 · 2019年10月26日

相关基金

LOXL1基因多态性与DNA甲基化在新疆维吾尔族剥脱综合征白内障发病过程中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

单链DNA结合蛋白WHIRLY1转录及表观遗传调控植物衰老和细胞死亡的研究

国家自然科学基金

0+阅读 · 2014年12月31日

CoFe2O4/BaSrTiO3复合势垒多铁隧道结的制备及隧穿特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

紫外线诱导系统性红斑狼疮DNA甲基化的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

天花粉蛋白调控miRNAs甲基化抑制宫颈癌上皮间质转化的实验研究

国家自然科学基金

0+阅读 · 2012年12月31日

番茄泛素连接酶CUL4-DDB1调控DNA甲基化的分子机理

国家自然科学基金

0+阅读 · 2011年12月31日

镉胁迫诱导拟南芥细胞内MLH1和MSH2基因突变及甲基化改变的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

蛋白质组学技术筛选生物标志物诊断污染土壤的生态毒性

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员