Masked language models have revolutionized natural language processing systems in the past few years. A recently introduced generalization of masked language models called warped language models are trained to be more robust to the types of errors that appear in automatic or manual transcriptions of spoken language by exposing the language model to the same types of errors during training. In this work we propose a novel approach that takes advantage of the robustness of warped language models to transcription noise for correcting transcriptions of spoken language. We show that our proposed approach is able to achieve up to 10% reduction in word error rates of both automatic and manual transcriptions of spoken language.
翻译:过去几年来,蒙面语言模式使自然语言处理系统发生了革命性的变化,最近引进了口语模式,称为扭曲语言模式的口语模式,通过在培训期间将语言模式暴露在相同类型的错误中,对口语自动或人工抄录中出现的错误类型进行培训,使其更加稳健。在这项工作中,我们提出一种新颖的办法,利用扭曲语言模式的稳健性,将抄录噪音用于纠正口语的抄录。我们表明,我们提议的办法能够将口语自动和人工抄录的字差率降低10%。