In this paper, we use pre-trained ResNet models as backbone architectures for classification of adventitious lung sounds and respiratory diseases. The knowledge of the pre-trained model is transferred by using vanilla fine-tuning, co-tuning, stochastic normalization and the combination of the co-tuning and stochastic normalization techniques. Furthermore, data augmentation in both time domain and time-frequency domain is used to account for the class imbalance of the ICBHI and our multi-channel lung sound dataset. Additionally, we apply spectrum correction to consider the variations of the recording device properties on the ICBHI dataset. Empirically, our proposed systems mostly outperform all state-of-the-art lung sound classification systems for the adventitious lung sounds and respiratory diseases of both datasets.
翻译:在本文中,我们使用经过预先训练的ResNet模型,作为对突发性肺声和呼吸道疾病进行分类的骨干结构;通过使用香草微调、联合调试、随机正常化以及联合调试和随机调试的混合技术,传授对预先训练的模型的知识;此外,用时间域和时频域的数据增加来计算ICBHI和我们多通道肺声数据集的分类不平衡。此外,我们应用频谱校正来考虑ICBHI数据集上记录装置特性的变化。