This paper analyzes dysarthric speech datasets from three languages with different prosodic systems: English, Korean, and Tamil. We inspect 39 acoustic measurements which reflect three speech dimensions including voice quality, pronunciation, and prosody. As multilingual analysis, examination on the mean values of acoustic measurements by intelligibility levels is conducted. Further, automatic intelligibility classification is performed to scrutinize the optimal feature set by languages. Analyses suggest pronunciation features, such as Percentage of Correct Consonants, Percentage of Correct Vowels, and Percentage of Correct Phonemes to be language-independent measurements. Voice quality and prosody features, however, generally present different aspects by languages. Experimental results additionally show that different speech dimension play a greater role for different languages: prosody for English, pronunciation for Korean, both prosody and pronunciation for Tamil. This paper contributes to speech pathology in that it differentiates between language-independent and language-dependent measurements in intelligibility classification for English, Korean, and Tamil dysarthric speech.
翻译:本文分析了三种语言的读写语音数据集:英语、韩语和泰米尔语。我们检查了39个反映三个语言层面的声学测量,包括声音质量、发音和口音。作为多语种分析,还按智能水平对声学测量的平均值进行了研究。此外,还进行了自动智能分类,以仔细检查语言设定的最佳特征。分析显示读音特征,如正确调子的百分比、正确调音的百分比和正确调音的百分比,以语言为独立计量。但声音质量和动词特征通常以语言呈现不同方面的情况。实验结果还表明,不同语言的语音层面作用更大:英语的读写、韩语的读音、泰米尔语的读写和发音。本文有助于语言病学,因为它区分了英语、韩语和泰米尔语调调等语言依赖性和依赖语言的测量。