We present JHU's system submission to the ASVspoof 2019 Challenge: Anti-Spoofing with Squeeze-Excitation and Residual neTworks (ASSERT). Anti-spoofing has gathered more and more attention since the inauguration of the ASVspoof Challenges, and ASVspoof 2019 dedicates to address attacks from all three major types: text-to-speech, voice conversion, and replay. Built upon previous research work on Deep Neural Network (DNN), ASSERT is a pipeline for DNN-based approach to anti-spoofing. ASSERT has four components: feature engineering, DNN models, network optimization and system combination, where the DNN models are variants of squeeze-excitation and residual networks. We conducted an ablation study of the effectiveness of each component on the ASVspoof 2019 corpus, and experimental results showed that ASSERT obtained more than 93% and 17% relative improvements over the baseline systems in the two sub-challenges in ASVspooof 2019, ranking ASSERT one of the top performing systems. Code and pretrained models will be made publicly available.
翻译:我们向ASVSpoof 2019挑战:用挤压式排泄和残留的神经工作(ASSERT)提出JHU的系统提交2019年ASVSpoof 挑战(ASSERT) : 使用挤压式排泄和残留的神经工作(ASSERT) : 自ASVSpoof 挑战(ASSERT)启动以来,反排吐式排泄系统已引起越来越多的注意,而ASVSpoof 2019年ASVSpooe、语音转换和重播等三大类型都致力于应对袭击。根据以前关于深神经网络(DNN)的研究工作,ASSERT是DN的反排泄方法的管道。ASSERT有四个组成部分:特征工程、DNNN模型、网络优化和系统组合,DNNS模型是挤压式排挤式排出式和残余网络。我们对ASSS 2019年ASTER 2019年两个次试查系统的基准系统每个组成部分的效能超过93%和17%的相对改进。