The deep learning models used for speaker verification are heavily dependent on large-scale data and correct labels. However, noisy (wrong) labels often occur, which deteriorates the system's performance. Unfortunately, there are relatively few studies in this area. In this paper, we propose a method to gradually filter noisy labels out at the training stage. We compare the network predictions at different training epochs with ground-truth labels, and select reliable (considered correct) labels by using the OR gate mechanism like that in logic circuits. Therefore, our proposed method is named as OR-Gate. We experimentally demonstrated that the OR-Gate can effectively filter noisy labels out and has excellent performance.
翻译:用于语音校验的深层学习模式严重依赖大型数据和正确的标签,然而,经常出现吵闹(错误)标签,使系统性能恶化。不幸的是,在这一领域的研究相对较少。在本文中,我们提出了一个在培训阶段逐步过滤吵闹标签的方法。我们将不同培训时代的网络预测与地面真实标签进行比较,并使用OR门机制(如逻辑路段中的OR门机制)选择可靠的(被认为是正确的)标签。因此,我们建议的方法被命名为ORGate。我们实验性地证明,ORGate可以有效地过滤噪音标签,并具有出色的性能。