The reconstruction of clipped speech signals is an important task in audio signal processing to achieve an enhanced audio quality for further processing. In this paper, Frequency Selective Extrapolation (FSE), which is commonly used for error concealment or the reconstruction of incomplete image data, is adapted to be able to restore audio signals which are distorted from clipping. For this, FSE generates a model of the signal as an iterative superposition of Fourier basis functions. Clipped samples can then be replaced by estimated samples from the model. The performance of the proposed algorithm is evaluated by using different speech test data sets. Compared to other state-of-the-art declipping algorithms, this leads to a maximum gain in SNR of up to 3:5 dB and an average gain of 1 dB.
翻译:剪贴语音信号的重建是音频信号处理的一项重要任务,目的是提高音频质量,供进一步处理。在本文中,通常用于隐藏错误或重建不完整图像数据的频率选择性外推法(FSE)经过调整,能够恢复从剪贴中扭曲的音频信号。为此,FSE生成了一个信号模型,作为Fourier基函数的迭接叠叠加。然后,剪贴样品可以由模型的估计样本取代。拟议算法的性能通过使用不同的语音测试数据集进行评估。与其他最先进的解析算法相比,这导致SNR的最大收益高达3:5 dB,平均收益为1 dB。