Cough is a common symptom of respiratory and lung diseases. Cough detection is important to prevent, assess and control epidemic, such as COVID-19. This paper proposes a model to detect cough events from cough audio signals. The models are trained by the dataset combined ESC-50 dataset with self-recorded cough recordings. The test dataset contains inpatient cough recordings collected from inpatients of the respiratory disease department in Ruijin Hospital. We totally build 15 cough detection models based on different feature numbers selected by Random Frog, Uninformative Variable Elimination (UVE), and Variable influence on projection (VIP) algorithms respectively. The optimal model is based on 20 features selected from Mel Frequency Cepstral Coefficients (MFCC) features by UVE algorithm and classified with Support Vector Machine (SVM) linear two-class classifier. The best cough detection model realizes the accuracy, recall, precision and F1-score with 94.9%, 97.1%, 93.1% and 0.95 respectively. Its excellent performance with fewer dimensionality of the feature vector shows the potential of being applied to mobile devices, such as smartphones, thus making cough detection remote and non-contact.
翻译:咳嗽是呼吸道和肺病的一种常见症状。 咳嗽检测对于预防、评估和控制传染病非常重要, 如COVID-19。 本文提出了一个从咳嗽声信号中检测咳嗽事件的模式。 模型由数据集结合ESC- 50数据集和自我记录的咳嗽记录进行培训。 测试数据集包含从鲁伊津医院呼吸道疾病部的住院病人那里收集的住院咳嗽记录。 我们完全根据随机蛙、 不明变量消除(UVE)和变量对投影算法(VIP)的不同特征数字建立了15种咳嗽检测模型。 最佳模型基于从Mel Right Cepstraal Covalies(MCC)特征中选定的20种特征,由UVE算法和辅助病媒机器(SVM)线性双级分类。 最佳咳嗽检测模型的准确性、 、 精确性和 F1 - 核心值分别为94.9%、 97.1%、 93.1% 和 0.95 。 特征病媒的精度表现显示其应用到移动设备的潜力, 如智能手机, 进行远程检测。