This paper proposes a single-channel speech enhancement method to reduce the noise and enhance speech at low signal-to-noise ratio (SNR) levels and non-stationary noise conditions. Specifically, we focus on modeling the noise using a Gaussian mixture model (GMM) based on a multi-stage process with a parametric Wiener filter. The proposed noise model estimates a more accurate noise power spectral density (PSD), and allows for better generalization under various noise conditions compared to traditional Wiener filtering methods. Simulations show that the proposed approach can achieve better performance in terms of speech quality (PESQ) and intelligibility (STOI) at low SNR levels.
翻译:本文建议采用单一通道语音增强方法,在低信号对噪音比率和非静止噪音条件下减少噪音和增强语音,具体地说,我们侧重于使用高斯混合模型(GMM)进行噪音建模,该模型以多阶段过程为基础,配有准维纳过滤器,拟议的噪音模型估计出一种更准确的噪音功率光谱密度(PSD),并允许在各种噪音条件下比传统的Wiener过滤方法更普遍化。模拟显示,拟议的方法可以在低语音质量(PESQ)和智能性(STOI)方面实现更好的表现。