Digital audio signal reconstruction of a lost or corrupt segment using deep learning algorithms has been explored intensively in recent years. Nevertheless, prior traditional methods with linear interpolation, phase coding and tone insertion techniques are still in vogue. However, we found no research work on reconstructing audio signals with the fusion of dithering, steganography, and machine learning regressors. Therefore, this paper proposes the combination of steganography, halftoning (dithering), and state-of-the-art shallow (RF- Random Forest regression) and deep learning (LSTM- Long Short-Term Memory) methods. The results (including comparing the SPAIN, Autoregressive, deep learning-based, graph-based, and other methods) are evaluated with three different metrics. The observations from the results show that the proposed solution is effective and can enhance the reconstruction of audio signals performed by the side information (e.g., Latent representation and learning for audio inpainting) steganography provides. Moreover, this paper proposes a novel framework for reconstruction from heavily compressed embedded audio data using halftoning (i.e., dithering) and machine learning, which we termed the HCR (halftone-based compression and reconstruction). This work may trigger interest in optimising this approach and/or transferring it to different domains (i.e., image reconstruction). Compared to existing methods, we show improvement in the inpainting performance in terms of signal-to-noise (SNR), the objective difference grade (ODG) and the Hansen's audio quality metric.
翻译:近年来,利用深层学习算法对失传或腐败部分的数字音频信号重建进行了深入探讨,然而,以往使用线性内插、阶段编码和音调插入技术的传统方法仍然在流行之中。然而,我们发现,在重建音频信号方面,没有开展过任何研究工作,将变速、变色和机器学习回归器混为一体。因此,本文件建议结合采用线性、半发式(抖动)和最先进的浅度(RF随机森林回归)和深度(LSTM-长期短期内存)方法。结果(包括比较SPAIN、自动递进式、深层学习、图制和其他方法),用三种不同的尺度进行评估。结果显示,拟议解决办法是有效的,可以加强侧面信息(例如,音频缩图的表示和学习)的音频信号的重建(RBRM)和深层次(我们用半收缩的音频数据重建(i. deplimal-read-read-degrading the minding the rual-deal-mographal-deal-deal-deal-deal-deal-mographal-deal-deal-de-de-de) restraction-s main-serview) 和我们用半调制成为当前、再造价和制成为历史和制的系统,可以显示和制平平平调制成。