As one of the most prevalent neurodegenerative disorders, Parkinson's disease (PD) has a significant impact on the fine motor skills of patients. The complex interplay of different articulators during speech production and realization of required muscle tension become increasingly difficult, thus leading to a dysarthric speech. Characteristic patterns such as vowel instability, slurred pronunciation and slow speech can often be observed in the affected individuals and were analyzed in previous studies to determine the presence and progression of PD. In this work, we used a phonetic recognizer trained exclusively on healthy speech data to investigate how PD affected the phonetic footprint of patients. We rediscovered numerous patterns that had been described in previous contributions although our system had never seen any pathological speech previously. Furthermore, we could show that intermediate activations from the neural network could serve as feature vectors encoding information related to the disease state of individuals. We were also able to directly correlate the expert-rated intelligibility of a speaker with the mean confidence of phonetic predictions. Our results support the assumption that pathological data is not necessarily required to train systems that are capable of analyzing PD speech.
翻译:帕金森氏病(PD)是最常见的神经退化性疾病之一,对病人的精巧运动技能有重大影响。不同的动脉在语音制作和达到必要的肌肉紧张状态期间的复杂相互作用越来越困难,导致出现一种狂躁的言辞。在受影响的人中,经常可以观察到元音不稳定、发音模糊和发音缓慢等典型模式,并在以前的研究中加以分析,以确定PD的存在和进展。在这项工作中,我们使用专门受过健康言语数据培训的语音识别器来调查PD如何影响病人的语音足迹。我们重新发现了以前在贡献中描述的许多模式,尽管我们系统以前从未见过任何病态的言语。此外,我们可以证明神经网络的中间作用可以作为特征矢量,将与个人疾病状态有关的信息编码起来。我们还能够直接将一名发言人专家的智能与语音预测的低可信度联系起来。我们的结果支持这样的假设,即病理学数据不一定需要用于培训能够分析PD语音系统的系统。