FLEURS:对普遍言论代表制的学习评价少见 (FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech)

We introduce FLEURS, the Few-shot Learning Evaluation of Universal Representations of Speech benchmark. FLEURS is an n-way parallel speech dataset in 102 languages built on top of the machine translation FLoRes-101 benchmark, with approximately 12 hours of speech supervision per language. FLEURS can be used for a variety of speech tasks, including Automatic Speech Recognition (ASR), Speech Language Identification (Speech LangID), Translation and Retrieval. In this paper, we provide baselines for the tasks based on multilingual pre-trained models like mSLAM. The goal of FLEURS is to enable speech technology in more languages and catalyze research in low-resource speech understanding.

翻译：我们引入了通用语言代表制的微小学习评估FLEURS基准。FLEURS是一个以102种语言建立的双向平行语言语音数据集,以机器翻译FLORes-101基准为基础,每个语言都有大约12小时的语音监督。FLEURS可用于各种语言的演讲任务,包括自动语音识别(ASR)、语音语言识别(Speech LangID)、翻译和检索。在本文中,我们为基于多种语言的预先培训模式(如MSLAM)的任务提供了基准。FLEURS的目标是使语言技术能够以更多语言进行,并促进对低资源语言理解的研究。

相关内容

小样本学习

关注 216

小样本学习（Few-Shot Learning，以下简称 FSL ）用于解决当可用的数据量比较少时，如何提升神经网络的性能。在 FSL 中，经常用到的一类方法被称为 Meta-learning。和普通的神经网络的训练方法一样，Meta-learning 也包含训练过程和测试过程，但是它的训练过程被称作 Meta-training 和 Meta-testing。

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日