NISST SRE CTS 超集:用于语音扬声器识别的大型数据集 (NIST SRE CTS Superset: A large-scale dataset for telephony speaker recognition)

This document provides a brief description of the National Institute of Standards and Technology (NIST) speaker recognition evaluation (SRE) conversational telephone speech (CTS) Superset. The CTS Superset has been created in an attempt to provide the research community with a large-scale dataset along with uniform metadata that can be used to effectively train and develop telephony (narrowband) speaker recognition systems. It contains a large number of telephony speech segments from more than 6800 speakers with speech durations distributed uniformly in the [10s, 60s] range. The segments have been extracted from the source corpora used to compile prior SRE datasets (SRE1996-2012), including the Greybeard corpus as well as the Switchboard and Mixer series collected by the Linguistic Data Consortium (LDC). In addition to the brief description, we also report speaker recognition results on the NIST 2020 CTS Speaker Recognition Challenge, obtained using a system trained with the CTS Superset. The results will serve as a reference baseline for the challenge.

翻译：本文件简要介绍了国家标准和技术研究所(NIST)语音语音识别评价(SRE)语音语音超集,创建CTS超集是为了向研究界提供大规模数据集以及可用于有效培训和开发电话(窄带)语音识别系统的统一元数据,其中载有来自6800多个发言者的大量电话语音部分,其语音持续时间在[10、60s]范围内统一分布。这些部分是从用于汇编先前的SRE数据集(SRE1996-2012年)的来源公司中提取的,包括灰熊体以及由语言数据联合会收集的切换板和混合系列。除了简要说明外,我们还报告了通过CTS Superse集培训的系统获得的NIST 2020 CTS语音识别挑战的语音识别结果。这些结果将作为挑战的参考基准。

相关内容

声纹识别

关注 0

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

从多个自我监督任务中学习问题无关的语音表示，Learning Problem-agnostic Speech Representations from Multiple Self-supervised Tasks

专知会员服务

17+阅读 · 2020年5月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【Yoshua Bengio新论文】多任务自监督学习语音识别，MULTI-TASK SELF-SUPERVISED LEARNING FOR ROBUST SPEECH RECOGNITION

专知会员服务

39+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日