The risk of Parkinson's disease (PD) is extremely serious, and PD speech recognition is an effective method of diagnosis nowadays. However, due to the influence of the disease stage, corpus, and other factors on data collection, the ability of every samples within one subject to reflect the status of PD vary. No samples are useless totally, and not samples are 100% perfect. This characteristic means that it is not suitable just to remove some samples or keep some samples. It is necessary to consider the sample transformation for obtaining high quality new samples. Unfortunately, existing PD speech recognition methods focus mainly on feature learning and classifier design rather than sample learning, and few methods consider the sample transformation. To solve the problem above, a PD speech sample transformation algorithm based on multitype reconstruction operators is proposed in this paper. The algorithm is divided into four major steps. Three types of reconstruction operators are designed in the algorithm: types A, B and C. Concerning the type A operator, the original dataset is directly reconstructed by designing a linear transformation to obtain the first dataset. The type B operator is designed for clustering and linear transformation of the dataset to obtain the second new dataset. The third operator, namely, the type C operator, reconstructs the dataset by clustering and convolution to obtain the third dataset. Finally, the base classifier is trained based on the three new datasets, and then the classification results are fused by decision weighting. In the experimental section, two representative PD speech datasets are used for verification. The results show that the proposed algorithm is effective. Compared with other algorithms, the proposed algorithm achieves apparent improvements in terms of classification accuracy.
翻译:Parkinson病(PD)的风险极其严重,PD语音识别是当今有效的诊断方法。然而,由于疾病阶段、体格和其他因素对数据收集的影响,每个受试者反映PD状态的能力各有不同。没有样品完全无用,而不是样品100%完美。这一特征意味着它不适宜仅仅删除某些样品或保存某些样品。有必要考虑样本转换,以获得高质量的新样本。不幸的是,现有的PD语音识别方法主要侧重于特征学习和分类设计,而不是抽样学习,而且很少有方法考虑样本的精度转换。为了解决上述问题,本文提出了基于多类型重建操作者的PD语音样本转换算法。算法分为四个主要步骤。三种类型的重建操作者是算法:A类、B类和C类。关于A类操作者,原始数据集通过设计线性转换直接重组,以获得第一个数据集。B类操作者设计了数据组合和线性转换数据配置方法,以获得第二个新数据的精度比重。本文中提出了基于多类型重建操作者数据的算法,第三个运算算出三个基数据序列的操作者将获得新的数据转换结果。在最后的计算中,操作者将获得新的数据转换数据。在C类中的计算中,第三个操作者将获得新的数据序列中,通过新的数据转换为新的数据转换数据。在最后的操作者将获得新的数据转换数据序列中,以新的数据转换数据。