Preschool evaluation is crucial because it gives teachers and parents influential knowledge about a children's growth and development. The coronavirus pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is the ability to speak. Because of the differences between children's and adults' voices, employing Automatic Speech Recognition(ASR) systems is difficult since they are pre-trained on adults' voices. We constructed an ASR for our cognitive test system to solve this issue using the Wav2Vec 2.0 model with a new pre-training objective called Random Frequency Pitch(RFP). In addition, we used our new dataset to fine-tune our model for Meaningless Words(MW) and Rapid Automatic Naming(RAN) tests. Our new approach reaches a Word Error Rate(WER) of 6.45 on the Persian section of the CommonVoice dataset. Furthermore, our novel methodology produces positive outcomes in zero- and few-shot scenarios.
翻译:学前教育评估至关重要,因为它为教师和家长提供了有关儿童成长和发展的有影响力的知识。冠状病毒大流行凸显了对学龄前儿童进行在线评估的必要性。应该测试的领域之一是说话能力。由于儿童的声音与成年人的声音不同,采用自动语音识别系统是困难的,因为他们接受过成人声音的预先培训。我们用Wav2Vec 2.0模型为我们的认知测试系统建造了ASR,用于解决这个问题,该模型采用了新的培训前目标,即随机频率Pitch(RFP)。此外,我们利用我们的新数据集来微调我们的无意义单词和快速自动命名(RAN)测试模式。我们的新方法在通用语音数据集的波斯语部分达到了6.45的“WER”错误率。此外,我们的新方法在零和几发情景中产生了积极的结果。