项目名称: 超宽带嵌入式变比特率音频编码算法研究
项目编号: No.60872027
项目类型: 面上项目
立项/批准年度: 2009
项目学科: 轻工业、手工业
项目作者: 鲍长春
作者单位: 北京工业大学
项目金额: 30万元
中文摘要: 本课题针对嵌入式语音与音频编码问题展开研究,结合各种信息处理方法和数学模型,提出了多种应用于嵌入式编码的新算法,包括:基于G.722.1的语音与音频嵌入式编码方法、基于位平面编码和SQVH技术的嵌入式编码方法、基于心理声学模型的音频子带重要性排序方法、指数阻尼正弦模型阶选择算法、基于经验模态分解的正弦音频信号分析合成方法、基于多帧分形预测的帧擦除掩蔽技术、最近邻匹配的非线性频带扩展方法、基于软判决矢量量化的语音频带扩展、基于混沌预测的音频信号高频重建方法、基于FFT的参数立体声编码方法、基于主成分分析的嵌入式立体声编码方法、基于高斯混合模型的谐波信号频带扩展方法等,并综合上述方法和技术,开发出了一套码率在8~64 kb/s 的超宽带嵌入式语音与音频编码算法,该算法经ITU-T认证的北京理工大学通信技术研究所进行标准化评测,测试结果表明,本课题提出的编码器在各码率下编码质量均达到了ITU-T提出的参考指标要求。
中文关键词: 语音编码;音频编码;嵌入式编码;帧擦除掩蔽;频带扩展
英文摘要: The research work is focused on the embedded speech and audio coding. By applying signal processing methods and mathematical models, some novel algorithms are proposed for the embedded coding, including :embedded speech and audio codec based on ITU-T recommendation G.722.1, embedded speech and audio coding method based on bit-plane coding and SQVH, the sub-band importance ordering method based on psychoacoustic model, order-selection algorithm for exponentially damped sinusoids model, sinusoidal audio analysis and synthesis model based on EMD, frame erasure concealment method based on multi-frame fractal prediction, nonlinear bandwidth extension based on nearest-neighbor matching, bandwidth extension of speech based on soft-decision vector quantization, high frequency reconstruction of audio signal based on chaotic prediction theory, parametric stereo coding method based on FFT,embedded stereo speech and audio coding method based on principal component analysis, harmonic bandwidth extension based on Gaussian Mixture Model and so on. An embedded speech and audio codec was developed by integrating aforementioned methods and techniques. This codec has been tested by Institute of Communication Technology, Beijing Institute of Technology Research which has been identified by the ITU-T. The test results show that the proposed codec could meet the requirements of the Terms of Reference (ToR) which was requested by ITU-T.
英文关键词: speech coding; audio coding; embedded coding; frame erasure concealment; bandwidth extension