An initial real-time speech enhancement method is presented to reduce the effects of additive noise. The method operates in the frequency domain and is a form of spectral subtraction. Initially, minimum statistics are used to generate an estimate of the noise signal in the frequency domain. The use of minimum statistics avoids the need for a voice activity detector (VAD) which has proven to be challenging to create. As minimum statistics are used, the noise signal estimate must be multiplied by a scaling factor before subtraction from the noise corrupted speech signal can take place. A spectral floor is applied to the difference to suppress the effects of "musical noise". Finally, a series of further enhancements are considered to reduce the effects of residual noise even further. These methods are compared using time-frequency plots to create the final speech enhancement design
翻译:首次实时语音增强方法用于减少添加性噪音的影响。该方法在频率域内运作,是一种光谱减色的形式。最初,使用最低统计数据对频率域的噪音信号作出估计。使用最低统计数据避免了对声音活动探测器(VAD)的需求,而事实证明该探测器的创建具有挑战性。在使用最低统计数据时,噪音信号估计值必须乘以一个缩放系数,然后才能从噪音腐蚀性语音信号中减去。对“音乐噪音”的影响进行了光谱层的区分。最后,还考虑进行一系列进一步的改进,以进一步降低残余噪音的影响。这些方法用时间频谱图进行比较,以创建最后的语音增强设计。