As a category of transfer learning, domain adaptation plays an important role in generalizing the model trained in one task and applying it to other similar tasks or settings. In speech enhancement, a well-trained acoustic model can be exploited to obtain the speech signal in the context of other languages, speakers, and environments. Recent domain adaptation research was developed more effectively with various neural networks and high-level abstract features. However, the related studies are more likely to transfer the well-trained model from a rich and more diverse domain to a limited and similar domain. Therefore, in this study, the domain adaptation method is proposed in unsupervised speech enhancement for the opposite circumstance that transferring to a larger and richer domain. On the one hand, the importance-weighting (IW) approach is exploited with a variance constrained autoencoder to reduce the shift of shared weights between the source and target domains. On the other hand, in order to train the classifier with the worst-case weights and minimize the risk, the minimax method is proposed. Both the proposed IW and minimax methods are evaluated from the VOICE BANK and IEEE datasets to the TIMIT dataset. The experiment results show that the proposed methods outperform the state-of-the-art approaches.
翻译:作为转让学习的类别,领域适应在推广在一项任务中经过培训的模式并将其应用于其他类似任务或设置方面起着重要作用。在增强语言能力方面,可以利用训练有素的声学模型获得其他语言、讲者和环境的语音信号。最近,利用各种神经网络和高层次抽象特征,更有效地开发了领域适应研究。然而,相关研究更有可能将训练有素的模式从丰富和更加多样化的领域转移到有限和类似的领域。因此,在本研究中,在向较大和更加丰富的领域转移的相反情况下,在不受监督的语音增强中提出了域适应方法。一方面,在利用重要性加权(IW)方法时,采用了差异限制的自动编码器,以减少源和目标领域之间共享权重的转移。另一方面,为了培训精密的分类器,将风险降到最低程度。拟议的IW和迷你式方法都是从VOICEANK和IEEE 数据集到试验状态的方法。拟议的实验方法显示了结果,展示了测试方法。