In this paper, we present specially designed automatic speech recognition (ASR) systems for the highly agglutinative and inflective languages of Tamil and Kannada that can recognize unlimited vocabulary of words. We use subwords as the basic lexical units for recognition and construct subword grammar weighted finite state transducer (SG-WFST) graphs for word segmentation that captures most of the complex word formation rules of the languages. We have identified the following category of words (i) verbs, (ii) nouns, (ii) pronouns, and (iv) numbers. The prefix, infix and suffix lists of subwords are created for each of these categories and are used to design the SG-WFST graphs. We also present a heuristic segmentation algorithm that can even segment exceptional words that do not follow the rules encapsulated in the SG-WFST graph. Most of the data-driven subword dictionary creation algorithms are computation driven, and hence do not guarantee morpheme-like units and so we have used the linguistic knowledge of the languages and manually created the subword dictionaries and the graphs. Finally, we train a deep neural network acoustic model and combine it with the pronunciation lexicon of the subword dictionary and the SG-WFST graph to build the subword-ASR systems. Since the subword-ASR produces subword sequences as output for a given test speech, we post-process its output to get the final word sequence, so that the actual number of words that can be recognized is much higher. Upon experimenting the subword-ASR system with the IISc-MILE Tamil and Kannada ASR corpora, we observe an absolute word error rate reduction of 12.39% and 13.56% over the baseline word-based ASR systems for Tamil and Kannada, respectively.
翻译:在本文件中,我们为泰米尔语和卡纳达语的高度混凝土和反动语言专门设计了自动语音识别系统(ASR),这些语言可以识别无限制的字词。我们使用子字作为基本词典单位,用于识别,并构建用于字形分解的子词语法加权定数(SG-WFST)图形(SG-WFST),以记录语言中大多数复杂的单词形成规则。我们确定了以下几类词 (一) 动词, (二) 名词, (二) 代词, (二) 代词, (四) 数字。前缀, 和后缀。我们为这些类别中的每一类别创建了子字词词典的预言, 预言和后缀列表的后缀列表 。我们用S-IAR-I 预言序做了一个语言和后程系统(SGISB) 的亚曲解算法。我们用SLOVA 和后程的亚行系统, 将一个语言和亚数的亚数整合系统,我们用SLIS-IL 的亚变码系统,我们用了SLA-Ralalalalalalalalalalxalalal com commal commal commal commal