MultiSV:远距离多通道发言人核查数据集 (MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification)

Motivated by unconsolidated data situation and the lack of a standard benchmark in the field, we complement our previous efforts and present a comprehensive corpus designed for training and evaluating text-independent multi-channel speaker verification systems. It can be readily used also for experiments with dereverberation, denoising, and speech enhancement. We tackled the ever-present problem of the lack of multi-channel training data by utilizing data simulation on top of clean parts of the Voxceleb dataset. The development and evaluation trials are based on a retransmitted Voices Obscured in Complex Environmental Settings (VOiCES) corpus, which we modified to provide multi-channel trials. We publish full recipes that create the dataset from public sources as the MultiSV corpus, and we provide results with two of our multi-channel speaker verification systems with neural network-based beamforming based either on predicting ideal binary masks or the more recent Conv-TasNet.

翻译：在未加综合的数据状况和缺乏实地标准基准的推动下,我们补充了我们以前的努力,并提出了旨在培训和评价文本独立的多频道扬声器核查系统的综合材料,还可以随时用于变形、调离和语音增强等实验。我们通过在Voxceleb数据集清洁部分之上利用数据模拟来解决缺乏多频道培训数据这一始终存在的问题。开发和评价试验的基础是在复杂的环境环境环境环境中重新传播的声音(VoiCES)系统,我们对该系统进行了修改,以提供多频道试验。我们公布了从公共来源创建数据集的全方位配方,作为多系统,我们还以预测理想的双环面具或最近的Conv-TasNet为基础,以神经网络为基础,以两个多频道扬声器核查系统为基础,提供了结果。

相关内容

数据集

关注 88

数据集，又称为资料集、数据集合或资料集合，是一种由数据所组成的集合。
Data set（或dataset）是一个数据的集合，通常以表格形式出现。每一列代表一个特定变量。每一行都对应于某一成员的数据集的问题。它列出的价值观为每一个变量，如身高和体重的一个物体或价值的随机数。每个数值被称为数据资料。对应于行数，该数据集的数据可能包括一个或多个成员。

自然语言生成综述

专知会员服务

65+阅读 · 2021年5月29日

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

专知会员服务

15+阅读 · 2020年5月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日