Conventional speech spoofing countermeasures (CMs) are designed to make a binary decision on an input trial. However, a CM trained on a closed-set database is theoretically not guaranteed to perform well on unknown spoofing attacks. In some scenarios, an alternative strategy is to let the CM defer a decision when it is not confident. The question is then how to estimate a CM's confidence regarding an input trial. We investigated a few confidence estimators that can be easily plugged into a CM. On the ASVspoof2019 logical access database, the results demonstrate that an energy-based estimator and a neural-network-based one achieved acceptable performance in identifying unknown attacks in the test set. On a test set with additional unknown attacks and bona fide trials from other databases, the confidence estimators performed moderately well, and the CMs better discriminated bona fide and spoofed trials that had a high confidence score. Additional results also revealed the difficulty in enhancing a confidence estimator by adding unknown attacks to the training set.
翻译:常规言论防范措施(CMs)旨在就输入试验作出二进制决定,然而,在封闭式数据库上受过训练的CM在理论上并不能保证在未知的潜伏攻击中表现良好。在某些情况下,另一种战略是让CM在缺乏信心时推迟作出决定。然后的问题是如何估计CM对输入试验的信任度。我们调查了一些可以很容易地插入CM的自信估计器。在ASVspoof 2019逻辑访问数据库中,结果显示,一个基于能量的天体仪和一个基于神经网络的天体网络在确定测试集中的未知攻击时取得了可接受的性能。在一个测试集中,在额外进行未知攻击和从其他数据库进行善意试验时,信任估计器进行了中度井,而CMs对信任得分较高的善意和假体试验进行了更好的歧视。其他结果还显示,难以通过在训练集中增加未知的攻击来增强信任估计器。