The paper copes with the task of automatic assessment of second language proficiency from the language learners' spoken responses to test prompts. The task has significant relevance to the field of computer assisted language learning. The approach presented in the paper relies on two separate modules: (1) an automatic speech recognition system that yields text transcripts of the spoken interactions involved, and (2) a multiple classifier system based on deep learners that ranks the transcripts into proficiency classes. Different deep neural network architectures (both feed-forward and recurrent) are specialized over diverse representations of the texts in terms of: a reference grammar, the outcome of probabilistic language models, several word embeddings, and two bag-of-word models. Combination of the individual classifiers is realized either via a probabilistic pseudo-joint model, or via a neural mixture of experts. Using the data of the third Spoken CALL Shared Task challenge, the highest values to date were obtained in terms of three popular evaluation metrics.
翻译:文件涉及从语言学习者的口语回答中自动评估第二语言熟练程度的任务。任务与计算机辅助语言学习领域密切相关。文件中提出的方法依靠两个不同的模块:(1) 自动语音识别系统,生成相关口语互动的文字记录;(2) 由深层学习者组成的多分类系统,将笔录分为熟练程度班级。不同的深层神经网络结构(即反馈前和经常性)在以下几个方面对文本的不同表述具有专门性:参考语法、概率语言模型的结果、几个字嵌入式和两个词包模式。单个分类者通过概率化假联模式或通过专家的神经混合组合实现合并。利用第三次Spoken Call共享任务挑战的数据,以三种流行评价指标获得迄今为止的最高值。