This paper introduces the single step time domain method named HnH-NRSE, whihc is designed for simultaneous speech intelligibility and quality improvement under noisy-reverberant conditions. In this solution, harmonic and non-harmonic elements of speech are separated by applying zero-crossing and energy criteria. An objective evaluation of the its non-stationarity degree is further used for an adaptive gain to treat masking components. No prior knowledge of speech statistics or room information is required for this technique. Additionally, two combined solutions, IRMO and IRMN, are proposed as composite methods for improvement on noisy-reverberant speech signals. The proposed and baseline methods are evaluated considering two intelligibility and three quality measures, applied for the objective prediction. The results show that the proposed scheme leads to a higher intelligibility and quality improvement when compared to competing methods in most scenarios. Additionally, a perceptual intelligibility listening test is performed, which corroborates with these results. Furthermore, the proposed HnH-NRSE solution attains SRMR quality measure with similar results when compared to the composed IRMO and IRMN techniques.
翻译:本文介绍了单步时间域方法,名为HnH-NRSE, Whighc是为在噪音反常条件下同时提高语音智能和质量而设计的,在这一解决方案中,通过应用零交叉和能量标准,将语音的调和和非调和元素分离出来; 对其非常态度进行客观评估,以适应性增益,处理遮蔽成分; 无需事先了解语言统计或室信息; 此外,还提出了两种综合解决方案,即IRMO和IRMN, 作为改进噪音反响语音信号的综合方法; 对拟议方法和基线方法进行评估时,考虑到两种智能性和三种质量措施,用于客观预测; 结果表明,与大多数情况下的竞合方法相比,拟议方案可提高智能和质量。 此外,还进行了感知性听觉测试,这与这些结果相符。 此外,拟议的HN-RISE解决方案在与构成的IRMO和IRMN技术相比,获得了类似结果的SRMRM质量计量。