Deep learning has brought impressive progress in the study of both automatic speaker verification (ASV) and spoofing countermeasures (CM). Although solutions are mutually dependent, they have typically evolved as standalone sub-systems whereby CM solutions are usually designed for a fixed ASV system. The work reported in this paper aims to gauge the improvements in reliability that can be gained from their closer integration. Results derived using the popular ASVspoof2019 dataset indicate that the equal error rate (EER) of a state-of-the-art ASV system degrades from 1.63% to 23.83% when the evaluation protocol is extended with spoofed trials.%subjected to spoofing attacks. However, even the straightforward integration of ASV and CM systems in the form of score-sum and deep neural network-based fusion strategies reduce the EER to 1.71% and 6.37%, respectively. The new Spoofing-Aware Speaker Verification (SASV) challenge has been formed to encourage greater attention to the integration of ASV and CM systems as well as to provide a means to benchmark different solutions.
翻译:虽然解决办法是相互依存的,但它们通常演变成独立的子系统,使内存解决方案通常用于固定的ASV系统。本文件报告的工作旨在衡量从更紧密的整合中可以取得的可靠性改进情况。使用流行的ASVspoof2019数据集得出的结果表明,当评价协议以虚假的试验扩展时,先进的ASV系统的平均误差率从1.63%降至23.83%。%受到欺骗式攻击的影响。然而,即使是直接整合的ASV和内存系统,以得分和深神经网络为基础的聚变战略的形式,也分别将EER降低到1.71%和6.37%。新的Spooofing-Aware发言人校验(SASVV)的挑战已经形成,目的是鼓励更多地关注ASV和内存系统的整合,并提供衡量不同解决方案的基准手段。