持续学习单语端至端自动语音识别 (Continual Learning for Monolingual End-to-End Automatic Speech Recognition)

Adapting Automatic Speech Recognition (ASR) models to new domains leads to a deterioration of performance on the original domain(s), a phenomenon called Catastrophic Forgetting (CF). Even monolingual ASR models cannot be extended to new accents, dialects, topics, etc. without suffering from CF, making them unable to be continually enhanced without storing all past data. Fortunately, Continual Learning (CL) methods, which aim to enable continual adaptation while overcoming CF, can be used. In this paper, we implement an extensive number of CL methods for End-to-End ASR and test and compare their ability to extend a monolingual Hybrid CTC-Transformer model across four new tasks. We find that the best performing CL method closes the gap between the fine-tuned model (lower bound) and the model trained jointly on all tasks (upper bound) by more than 40%, while requiring access to only 0.6% of the original data.

翻译：将自动语音识别模式(ASR)适应到新的领域会导致原有域域性表现的恶化,这是一种称为“灾难性遗忘”的现象。即使是单一语言的ASR模式也不可能在不受到CF影响的情况下推广到新的口音、方言、专题等,这使得它们无法在不储存所有过去的数据的情况下不断得到提升。幸运的是,可以使用旨在允许持续适应同时又克服CF的连续学习方法。在本文件中,我们应用了大量的CL方法,用于终端至终端式的ASR,并测试和比较它们将单一语言混合的CT- Transext模型扩展至四个新任务的能力。我们发现,最佳的CL方法缩小了微调模式(低约束)与所有任务(上约束)联合培训的模式之间的差距超过40%,同时要求只访问原始数据的0.6%。

相关内容

Continuity

关注 0

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

元学习(meta learning) 最新进展综述论文

专知会员服务

281+阅读 · 2020年5月8日

【ACL2020-亚马逊】Transformers多分辨率和多模态语音识别，Multiresolution and Multimodal Speech Recognition with Transformers

专知会员服务

15+阅读 · 2020年5月5日

【InterSpeech2020】混合语音识别系统中的词汇扩展技术，Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

专知会员服务

17+阅读 · 2020年3月23日

元迁移学习的小样本学习，Meta-transfer Learning for Few-shot Learning

专知会员服务

159+阅读 · 2020年2月29日