Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing Automatic Speech Recognition(ASR) system is useless, since they of pre-trained on voices, that are different from children's voices in terms of frequency and amplitude. We constructed an ASR for our cognitive test system to solve this issue using the Wav2Vec 2.0 model with a new pre-training objective called Random Frequency Pitch(RFP). In addition, we used our new dataset to fine-tune our model for Meaningless Words(MW) and Rapid Automatic Naming(RAN) tests. Our new approach reaches a Word Error Rate(WER) of 6.45 on the Persian section of the CommonVoice dataset. Furthermore, our novel methodology produces positive outcomes in zero- and few-shot scenarios.
翻译:学前教育评估至关重要,因为它使教师和家长对儿童成长和发展具有影响力的知识。COVID-19大流行突出了对学龄前儿童进行在线评估的必要性。应该测试的领域之一是他们说话的能力。使用自动语音识别系统是毫无用处的,因为他们在声音上受过预先训练,在频率和振幅方面与儿童的声音不同。我们为我们的认知测试系统建造了ASR,用Wav2Vec 2.0模式和新的培训前目标“随机频率小便”解决了这个问题。此外,我们利用我们的新数据集微调了我们关于无意义字词和快速自动命名的测试模式。我们的新方法在通用语音数据集的波斯部分达到了6.45的“WER”值。此外,我们的新方法在零和少见的情景中产生了积极的结果。