This paper introduces a large-scale Korean speech dataset, called VOTE400, that can be used for analyzing and recognizing voices of the elderly people. The dataset includes about 300 hours of continuous dialog speech and 100 hours of read speech, both recorded by the elderly people aged 65 years or over. A preliminary experiment showed that speech recognition system trained with VOTE400 can outperform conventional systems in speech recognition of elderly people's voice. This work is a multi-organizational effort led by ETRI and MINDs Lab Inc. for the purpose of advancing the speech recognition performance of the elderly-care robots.
翻译:本文介绍名为VOTE400的大规模韩国语言数据集,可用于分析和承认老年人的声音,该数据集包括大约300小时连续对话讲话和100小时阅读演讲,两者均为65岁或65岁以上的老年人所记录。初步实验显示,接受VOTE400培训的语音识别系统在语音识别老年人的声音方面可以优于常规系统。这是一项由ENTRI和MIND实验室公司牵头的多组织工作,目的是提高老年人护理机器人的语音识别功能。