This paper introduces a new open-source speech corpus named "speechocean762" designed for pronunciation assessment use, consisting of 5000 English utterances from 250 non-native speakers, where half of the speakers are children. Five experts annotated each of the utterances at sentence-level, word-level and phoneme-level. A baseline system is released in open source to illustrate the phoneme-level pronunciation assessment workflow on this corpus. This corpus is allowed to be used freely for commercial and non-commercial purposes. It is available for free download from OpenSLR, and the corresponding baseline system is published in the Kaldi speech recognition toolkit.
翻译:本文介绍一个新的开放源码语音材料,名为“Speech Oceans762”,用于读音评估,由250名非母语发言者(其中一半为儿童)的5 000个英语词句组成,其中一半为儿童,五名专家在判决、字级和电话级各附加说明,一个基线系统在公开源码中发布,以说明该文体的电话-音级读音评估工作流程,允许免费用于商业和非商业目的,可免费从开放空间服务中心下载,相应的基线系统在Kaldi语音识别工具包中公布。