In this paper, we present a causal speech signal improvement system that is designed to handle different types of distortions. The method is based on a generative diffusion model which has been shown to work well in scenarios with missing data and non-linear corruptions. To guarantee causal processing, we modify the network architecture of our previous work and replace global normalization with causal adaptive gain control. We generate diverse training data containing a broad range of distortions. This work was performed in the context of an "ICASSP Signal Processing Grand Challenge" and submitted to the non-real-time track of the "Speech Signal Improvement Challenge 2023", where it was ranked fifth.
翻译:在本文中,我们提出了一个因果言语信号改进系统,旨在处理不同类型的扭曲现象,该方法基于一种基因化传播模式,在缺少数据和非线性腐败的情况下,该方法已证明运作良好。为了保证因果处理,我们修改了我们以前工作的网络结构,用因果适应收益控制取代全球正常化。我们生成了包含广泛扭曲现象的各种培训数据。这项工作是在“ICASSP信号处理重大挑战”的背景下进行的,并提交给了“2023年语音信号改进挑战”的非实时跟踪,而“2023年语音信号改进挑战”排在第五位。</s>