Preschool evaluation is crucial because it gives teachers and parents influential knowledge about children's growth and development. The COVID-19 pandemic has highlighted the necessity of online assessment for preschool children. One of the areas that should be tested is their ability to speak. Employing an Automatic Speech Recognition(ASR) system is useless since they are pre-trained on voices that are different from children's voices in terms of frequency and amplitude. We constructed an ASR for our cognitive test system to solve this issue using the Wav2Vec 2.0 model with a new pre-training objective called Random Frequency Pitch(RFP). In addition, we used our new dataset to fine-tune our model for Meaningless Words(MW) and Rapid Automatic Naming(RAN) tests. Our new approach reaches a Word Error Rate(WER) of 6.45 on the Persian section of the CommonVoice dataset. Furthermore, our novel methodology produces positive outcomes in zero- and few-shot scenarios.
翻译:学前教育评估至关重要,因为它使教师和家长对儿童成长和发育具有影响力的知识。COVID-19大流行突出了对学龄前儿童进行在线评估的必要性。应该测试的领域之一是他们说话的能力。使用自动语音识别系统是毫无用处的,因为他们接受过与儿童声音在频率和振幅方面的不同声音的预先培训。我们为我们的认知测试系统建造了ASR,以使用Wav2Vec 2.0模式和新的培训前目标,即随机频率Pitch(RFP)解决这个问题。此外,我们使用我们的新数据集来微调我们无意义词和快速自动命名(RAN)测试的模型。我们的新方法在通用语音数据集的波斯语部分达到了6.45的“WER”值。此外,我们的新方法在零和微弱的情景中产生了积极的结果。