This paper describes the XMUSPEECH speaker recognition and diarisation systems for the VoxCeleb Speaker Recognition Challenge 2021. For track 2, we evaluate two systems including ResNet34-SE and ECAPA-TDNN. For track 4, an important part of our system is VAD module which greatly improves the performance. Our best submission on the track 4 obtained on the evaluation set DER 5.54% and JER 27.11%, while the performance on the development set is DER 2.92% and JER 20.84%.
翻译:本文介绍了VoxCeleb发言人承认挑战2021年的XMUSPEECH语音识别和分解系统。关于第2轨,我们评估了两个系统,包括ResNet34-SE和ECAPA-TDNN。关于第4轨,我们系统的一个重要部分是VAD模块,该模块大大改进了业绩。我们在第4轨上提交的最佳信息来自评价指标DER 5.54%和JER 27.11%,而开发指标的绩效为2.92%和20.84%。