We propose a novel approach for semi-supervised learning (SSL) designed to overcome distribution shifts between training and real-world data arising in the keyword spotting (KWS) task. Shifts from training data distribution are a key challenge for real-world KWS tasks: when a new model is deployed on device, the gating of the accepted data undergoes a shift in distribution, making the problem of timely updates via subsequent deployments hard. Despite the shift, we assume that the marginal distributions on labels do not change. We utilize a modified teacher/student training framework, where labeled training data is augmented with unlabeled data. Note that the teacher does not have access to the new distribution as well. To train effectively with a mix of human and teacher labeled data, we develop a teacher labeling strategy based on confidence heuristics to reduce entropy on the label distribution from the teacher model; the data is then sampled to match the marginal distribution on the labels. Large scale experimental results show that a convolutional neural network (CNN) trained on far-field audio, and evaluated on far-field audio drawn from a different distribution, obtains a 14.3% relative improvement in false discovery rate (FDR) at equal false reject rate (FRR), while yielding a 5% improvement in FDR under no distribution shift. Under a more severe distribution shift from far-field to near-field audio with a smaller fully connected network (FCN) our approach achieves a 52% relative improvement in FDR at equal FRR, while yielding a 20% relative improvement in FDR on the original distribution.
翻译:我们提出了半监督学习的新办法(SSL),旨在克服培训与关键字识别(KWS)任务产生的真实世界数据之间的分布变化。培训数据分配的转变是真实世界 KWS任务的关键挑战:当在设备上部署新模型时,所接受数据的标签在分布上会发生变化,使通过随后部署及时更新的问题变得困难。尽管有了这一转变,我们假设标签上的边际分布不会改变。我们使用一个经过修改的教师/学生培训框架,在这个框架中,标签的培训数据用未贴标签的数据来扩充。注意到教师无法获取新的分布;培训数据分配是一个关键挑战:当在设备上部署新模型时,所接受的数据的标签标签在分布上会发生变化,使通过随后的部署来及时更新的问题变得困难。尽管如此转变,我们假设标签上的边际分布不会改变。大规模实验结果显示,在远方位音频(CN)的升级网络中,在远方域音频(FRRDR)上进行更小规模的升级改进,在远处的流流流流率下,在远处进行更低频流流流流的递后,在FRRRDRDR中进行更低的销售率下,在14次下进行虚假的升级的改善。