This paper introduces the Unbeatable Team's submission to the ICASSP 2023 Deep Noise Suppression (DNS) Challenge. We expand our previous work, TEA-PSE, to its upgraded version -- TEA-PSE 3.0. Specifically, TEA-PSE 3.0 incorporates a residual LSTM after squeezed temporal convolution network (S-TCN) to enhance sequence modeling capabilities. Additionally, the local-global representation (LGR) structure is introduced to boost speaker information extraction, and multi-STFT resolution loss is used to effectively capture the time-frequency characteristics of the speech signals. Moreover, retraining methods are employed based on the freeze training strategy to fine-tune the system. According to the official results, TEA-PSE 3.0 ranks 1st in both ICASSP 2023 DNS-Challenge track 1 and track 2.
翻译:本文介绍了不可战胜小组向ICASSP 2023深海噪音抑制挑战提交的呈文。我们把我们以前的工作TEA-PSE扩大到升级版TEA-PSE 3.0。具体地说,TEA-PSE 3.0在挤压的时变网络(S-TCN)后纳入了残留的LSTM,以加强测序建模能力。此外,还引入了地方-全球代表结构,以促进语音信息提取,并使用多端端方位电话解析损失来有效捕捉语音信号的时间频率特征。此外,还根据冻结培训战略采用了再培训方法,对系统进行微调。根据官方结果,TEA-PSE 3.0在ICASSP 2023 DNS-Challenge轨道1和轨道2中都名列第1位。</s>