Background: An early diagnosis together with an accurate disease progression monitoring of multiple sclerosis is an important component of successful disease management. Prior studies have established that multiple sclerosis is correlated with speech discrepancies. Early research using objective acoustic measurements has discovered measurable dysarthria. Objective: To determine the potential clinical utility of machine learning and deep learning/AI approaches for the aiding of diagnosis, biomarker extraction and progression monitoring of multiple sclerosis using speech recordings. Methods: A corpus of 65 MS-positive and 66 healthy individuals reading the same text aloud was used for targeted acoustic feature extraction utilizing automatic phoneme segmentation. A series of binary classification models was trained, tuned, and evaluated regarding their Accuracy and area-under-curve. Results: The Random Forest model performed best, achieving an Accuracy of 0.82 on the validation dataset and an area-under-curve of 0.76 across 5 k-fold cycles on the training dataset. 5 out of 7 acoustic features were statistically significant. Conclusion: Machine learning and artificial intelligence in automatic analyses of voice recordings for aiding MS diagnosis and progression tracking seems promising. Further clinical validation of these methods and their mapping onto multiple sclerosis progression is needed, as well as a validating utility for English-speaking populations.
翻译:早期诊断和对多发性硬化症进行准确的疾病进展监测是成功进行疾病管理的重要组成部分。先前的研究已经证实,多发性硬化与言词差异有关。利用客观的声学测量进行早期研究发现了可测量的剧变。目标:确定机器学习和深学习/AI方法在临床的潜在效用,以及利用语音记录对多发性硬化症进行诊断、生物标志提取和进展监测的深度学习/AI方法。方法:65个MS-阳性和66个健康的人读了同一文本的呼声,用于利用自动电话路段进行有目标的声谱提取。一系列二元分类模型经过培训、调整和评价,其准确性和地区偏差。结果:随机森林模型表现最佳,在验证数据集上实现了0.82的准确性,在培训数据集的5公里周期内实现了0.76的准确性。在7个音响特征中有5个具有统计意义。结论:在自动分析语音记录用于协助MS诊断和不断跟踪的语音记录方面,机器学习和人工智能分类模型,进一步验证这些方法的实用性进展,作为有效的英国人口需要的临床诊断和临床诊断。