Loanwords, such as Anglicisms, are a challenge in German speech recognition. Due to their irregular pronunciation compared to native German words, automatically generated pronunciation dictionaries often include faulty phoneme sequences for Anglicisms. In this work, we propose a multitask sequence-to-sequence approach for grapheme-to-phoneme conversion to improve the phonetization of Anglicisms. We extended a grapheme-to-phoneme model with a classifier to distinguish Anglicisms from native German words. With this approach, the model learns to generate pronunciations differently depending on the classification result. We used our model to create supplementary Anglicism pronunciation dictionaries that are added to an existing German speech recognition model. Tested on a dedicated Anglicism evaluation set, we improved the recognition of Anglicisms compared to a baseline model, reducing the word error rate by 1 % and the Anglicism error rate by 3 %. We show that multitask learning can help solving the challenge of loanwords in German speech recognition.
翻译:诸如Anglicism 等名词是德国语音识别中的一项挑战。 由于这些名词与本土德语词相比的发音不规则,自动生成的发音词典往往包括安格利西主义的错误的发音序列。 在这项工作中,我们提议了一种多任务序列到顺序的语法转换方法,用于图形化对电话的转换,以改进安格利西主义的语音化。我们扩展了一个配有分级器的图形化到语音模型,以区分安格利西主义与本土德语的词。通过这种方法,该模型学会了根据分类结果的不同生成发音。我们使用我们的模型创建了补充的安格利西主义发音词典,这些词典被添加到现有的德国语音识别模型中。我们试验了一个专门的安格利西主义评价集,我们改进了对安格主义的认知,比一个基线模型,将单词错误率降低了1%,安格利西主义错误率降低了3%。我们显示多任务学习可以帮助解决德语语音识别中的借词的挑战。