Subjective evaluation results for two low-latency deep neural networks (DNN) are compared to a matured version of a traditional Wiener-filter based noise suppressor. The target use-case is real-world single-channel speech enhancement applications, e.g., communications. Real-world recordings consisting of additive stationary and non-stationary noise types are included. The evaluation is divided into four outcomes: speech quality, noise transparency, speech intelligibility or listening effort, and noise level w.r.t. speech. It is shown that DNNs improve noise suppression in all conditions in comparison to the traditional Wiener-filter baseline without major degradation in speech quality and noise transparency while maintaining speech intelligibility better than the baseline.
翻译:将两个低纬度深神经网络(DNN)的主观评价结果与基于传统的Wiener-过滤器抑制噪音的成熟版本比较,目标使用情况是实际的单一频道增强语音应用,例如通信;包括由静态和非静态噪音类添加剂组成的真实世界记录;评价分为四个结果:语言质量、噪音透明度、语音智能或监听努力,以及噪音水平(w.r.t.)演讲。