Intent Recognition and Slot Identification are crucial components in spoken language understanding (SLU) systems. In this paper, we present a novel approach towards both these tasks in the context of low resourced and unwritten languages. We present an acoustic based SLU system that converts speech to its phonetic transcription using a universal phone recognition system. We build a word-free natural language understanding module that does intent recognition and slot identification from these phonetic transcription. Our proposed SLU system performs competitively for resource rich scenarios and significantly outperforms existing approaches as the amount of available data reduces. We observe more than 10% improvement for intent classification in Tamil and more than 5% improvement for intent classification in Sinhala. We also present a novel approach towards unsupervised slot identification using normalized attention scores. This approach can be used for unsupervised slot labelling, data augmentation and to generate data for a new slot in a one-shot way with only one speech recording
翻译:在本文中,我们提出了一种在资源贫乏和不成文语言背景下处理这两项任务的新颖方法。我们提出了一个基于声音的SLU系统,利用通用电话识别系统将语音转换成语音转录;我们建立了一个无字自然语言理解模块,该模块的用意是识别和从这些语音转录中确定空档。我们提议的SLU系统在资源丰富情景中具有竞争力,并随着可用数据数量的减少,大大优于现有方法。我们看到泰米尔语意向分类改进了10%以上,僧伽罗语意向分类改进了5%以上。我们还提出了一个使用正常注意分数将语音转换成语音转录的无监控槽识别的新方法。这个方法可用于无监控的槽标签、数据增强和以一发式方式生成新槽的数据,只有一个语音记录。