Speech enhancement methods based on deep learning have surpassed traditional methods. While many of these new approaches are operating on the wideband (16kHz) sample rate, a new fullband (48kHz) speech enhancement system is proposed in this paper. Compared to the existing fullband systems that utilizes perceptually motivated features to train the fullband speech enhancement using a single network structure, the proposed system is a two-step system ensuring good fullband speech enhancement quality while backward compatible to the existing wideband systems.
翻译:以深层学习为基础的语音增强方法已经超越了传统方法,虽然许多这些新办法都采用宽频(16kHz)样本率,但本文件提议采用新的全频(48kHz)语音增强系统,与现有的全频系统相比,该系统利用感知驱动功能,利用单一网络结构来培训全频语音增强,拟议系统是一个两步制,确保全频语音提高质量良好,与现有宽频系统向后兼容。