Accurate and early prediction of a disease allows to plan and improve a patient's quality of future life. During pandemic situations, the medical decision becomes a speed challenge in which physicians have to act fast to diagnose and predict the risk of the severity of the disease, moreover this is also of high priority for neurodegenerative diseases like Parkinson's disease. Machine Learning (ML) models with Features Selection (FS) techniques can be applied to help physicians to quickly diagnose a disease. FS optimally subset features that improve a model performance and help reduce the number of needed tests for a patient and hence speeding up the diagnosis. This study shows the result of three Feature Selection (FS) techniques pre-applied to a classifier algorithm, Logistic Regression, on non-invasive test results data. The three FS are Analysis of Variance (ANOVA) as filter based method, Least Absolute Shrinkage and Selection Operator (LASSO) as embedded method and Sequential Feature Selection (SFS) as wrapper method. The outcome shows that FS technique can help to build an efficient and effective classifier, hence improving the performance of the classifier while reducing the computation time.
翻译:对疾病的准确和早期预测有助于规划和改善患者未来生活质量。在大流行病情况下,医疗决定成为一项快速挑战,医生必须迅速采取行动,诊断和预测疾病严重性的风险,这也是帕金森病等神经退化性疾病的高度优先事项。具有特征选择(FS)技术的机器学习模型可以用于帮助医生快速诊断疾病。FS最优化的子集功能,可以改进模型性能,帮助减少患者所需的检测数量,从而加快诊断速度。这项研究显示三种特征选择技术(FS)预先应用到分类算法、物流倒退、非侵入性测试结果数据的结果。三种FS是作为过滤法分析差异(ANOVA),最小绝对缩小和选择操作员(LASSO)作为嵌入方法和序列性选择方法,作为包装方法。结果显示FS技术可以帮助构建一个高效和高效的分类,从而改进分类方法的性能。