In this work, the issue of Parkinson's disease (PD) diagnostics using non-invasive antemortem techniques was tackled. A deep learning approach for classification of raw speech recordings in patients with diagnosed PD was proposed. The core of proposed method is an audio classifier using knowledge transfer from a pretrained natural language model, namely \textit{wav2vec 2.0}. Method was tested on a group of 38 PD patients and 10 healthy persons above the age of 50. A dataset of speech recordings acquired using a smartphone recorder was constructed and the recordings were label as PD/non-PD with severity of the disease additionally rated using Hoehn-Yahr scale. The audio recordings were cut into 2141 samples that include sentences, syllables, vowels and sustained phonation. The classifier scores up to 97.92\% of cross-validated accuracy. Additionally, paper presents results of a human-level performance assessment questionnaire, which was consulted with the neurology professionals
翻译:在这项工作中,解决了使用非侵入性死前技术对帕金森氏病(PD)进行诊断的问题;提出了对诊断为PD的病人的原始语音录音进行分类的深层次学习方法;拟议方法的核心是使用预先培训的自然语言模式,即\ textit{wav2vec2.0}的知识转让的音频分类器;对38名PD病人和10名50岁以上健康人士进行了方法测试;制作了使用智能电话记录器获得的语音录音数据集,录音被标为PD/非PD,其病情严重程度按Hoehn-Yahr等级被进一步评级;录音被切成2 141个样本,其中包括判决、音频、元音和持续流行;分类者得分数达到97.92%的交叉有效准确度;此外,文件还介绍了与神经学专业人员协商的人类级别业绩评估问卷的结果。