Automatic speaker verification (ASV) is a well developed technology for biometric identification, and has been ubiquitous implemented in security-critic applications, such as banking and access control. However, previous works have shown that ASV is under the radar of adversarial attacks, which are very similar to their original counterparts from human's perception, yet will manipulate the ASV render wrong prediction. Due to the very late emergence of adversarial attacks for ASV, effective countermeasures against them are limited. Given that the security of ASV is of high priority, in this work, we propose the idea of "voting for the right answer" to prevent risky decisions of ASV in blind spot areas, by employing random sampling and voting. Experimental results show that our proposed method improves the robustness against both the limited-knowledge attackers by pulling the adversarial samples out of the blind spots, and the perfect-knowledge attackers by introducing randomness and increasing the attackers' budgets. The code for reproducing main results is available at https://github.com/thuhcsi/adsv_voting.
翻译:自动扬声器核查(ASV)是生物鉴别的先进技术,在银行和出入控制等安全机密应用中普遍应用,但是,以前的工作表明,ASV处于对抗性攻击的雷达之下,这种攻击与最初的对口者非常相似,但根据人类的看法,它会操纵ASV, 作出错误的预测。由于对口攻击的出现很迟,因此,针对他们的有效反措施是有限的。鉴于ASV的安全是高度优先事项,在这项工作中,我们提议采用随机抽样和投票方式,“投票支持正确的答案”的想法,以防止在盲点地区对ASV作出危险的决定。实验结果显示,我们提议的方法是通过将对抗性攻击者从盲点提取对抗性样品,通过引进随机性和增加攻击者的预算来增强对口攻击者的能力。在https://github.com/thucsi/adsv_voting中可以找到重新产生主要结果的代码。