The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of bi-annual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge.
翻译:ASVspooof倡议的构想是带头研究反吹嘘自动扬声器核查(ASV)的反吹嘘倡议。本文件描述了一系列双年度挑战中的第三个挑战:2019年ASVspoof 2019年ASVspoof 。随着挑战数据库和协议的描述,本文件的重点是62个团队提交的结果和最高性能单一和集合系统,这些团队的单一和集合系统都比两个基线系统表现得更好,而且往往有很大的幅度。更深层的分析表明,性能受具体条件的影响,涉及具体的重复攻击或特定的音响环境。混合对涉及语音合成和声音转换攻击的逻辑访问情景特别有效,但参与者基本上为成功应用涉及模拟重播攻击的实际访问情景而挣扎,这很可能是系统缺乏互补性的结果,而骨电聚实验显然有可能改善性能。此外,模拟数据的结果显示,真实重现数据的实验显示存在巨大的差距,最可能是由于后一种添加噪音。这一结论除其他外,导致对未来版本的ASV进行进一步的研究和方向提出若干挑战。