Automatic Speech Recognition (ASR) is an active field of research due to its huge number of applications and the proliferation of interfaces or computing devices that can support speech processing. But the bulk of applications is based on well-resourced languages that overshadow under-resourced ones. Yet ASR represents an undeniable mean to promote such languages, especially when design human-to-human or human-to-machine systems involving illiterate people. An approach to design an ASR system targeting under-resourced languages is to start with a limited vocabulary. ASR using a limited vocabulary is a subset of the speech recognition problem that focuses on the recognition of a small number of words or sentences. This paper aims to provide a comprehensive view of mechanisms behind ASR systems as well as techniques, tools, projects, recent contributions, and possibly future directions in ASR using a limited vocabulary. This work consequently provides a way to go when designing ASR system using limited vocabulary. Although an emphasis is put on limited vocabulary, most of the tools and techniques reported in this survey applied to ASR systems in general.
翻译:自动语音识别(ASR)是一个积极的研究领域,因为其应用数量巨大,而且可以支持语音处理的界面或计算装置激增,但大部分应用都基于资源充足的语言,使资源不足的语言蒙上阴影;然而,自动语音识别是推广这些语言的一个不可否认的手段,特别是在设计涉及文盲的人力到人类或人力到机器系统时;设计针对资源不足语言的自动语音识别系统的方法是以有限的词汇开始的;使用有限的词汇是语音识别问题的一个部分,侧重于承认少数词或句子;本文旨在全面介绍ASR系统背后的机制,以及技术、工具、项目、最近的贡献,以及可能使用有限的词汇的ASR系统未来方向;因此,这项工作为使用有限的词汇设计ASR系统提供了一条途径;尽管本次调查中报告的大多数工具和技术都用于一般的ASR系统,但重点是有限的词汇。