Packet loss concealment (PLC) is a tool for enhancing speech degradation caused by poor network conditions or underflow/overflow in audio processing pipelines. We propose a real-time recurrent method that leverages previous outputs to mitigate artefact of lost packets without the prior knowledge of loss mask. The proposed full-band recurrent network (FRN) model operates at 48 kHz, which is suitable for high-quality telecommunication applications. Experiment results highlight the superiority of FRN over an offline non-causal baseline and a top performer in a recent PLC challenge.
翻译:封藏包装物损失(PLC)是促使网络条件差或音频处理管道的流入/流出不足/流出导致语音退化的工具,我们建议采用实时经常性方法,利用先前的产出,在事先不知道丢失掩码的情况下,减少丢失包的精密性;拟议的全频频网络模式在48千赫兹运行,适合高质量的电信应用;实验结果突出表明FRN优于离线非因果基线,在最近的PLC挑战中,FRN优于顶级表现者。