项目名称: 三维音频中空间方位信息感知编码关键技术研究
项目编号: No.61201169
项目类型: 青年科学基金项目
立项/批准年度: 2013
项目学科: 电子学与信息系统
项目作者: 王晓晨
作者单位: 武汉大学
项目金额: 25万元
中文摘要: 基于多音箱回放的3D音频技术快速发展,MPEG开始制订3D音频标准,3D音频技术已成为新的热点。相较于传统环绕声,3D音频的关键就是其对三维空间方位感的重现,因此空间信息编码是3D音频编码系统的核心。研究显示人耳对不同方位、频率的空间信息感知阈值相差可达40倍,因此基于感知的3D音频空间信息编码成为3D音频高性能编码的关键。本项目针对现有多声道编码技术缺少对声音空间信息感知特性的考虑,在追求压缩率时空间信息感知失真过大的问题,在已有空间听觉实验的基础上,将传统感知熵理论拓展到空间可感知信息量的计算,建立可感知空间信息度量模型,给出基于感知的空间信息失真测度,完成基于感知的空间信息量化器设计,研究感知失真条件下空间信息比特分配算法,最终构建基于感知的空间信息编码框架,预期可进一步改善现有3D音频编码器主观性能,研究成果渴望成为相关标准的支撑技术,为解决当前3D音频编码的性能瓶颈提供技术支撑。
中文关键词: 三维音频编码;感知失真测度;空间参数;;
英文摘要: As the rapid development of 3D audio technology based on the Multi-speaker playback, MPEG started to develop the 3D Audio standard. 3D audio technology has been a new hot spot. Compared with the traditional surrounding audio, the Key point of the 3D audio encoding system is the Reproducibility of 3D direction sense. As a result, encoding the spatial information is the core part of the 3D audio encoding system. Researchers are shown that the spatial information perception threshold for different directions and frequencies, the spatial information perception threshold differs up to 40 times to the human ear, so that 3D audio spatial information encoding based on the perception is the key point for 3D audio high-performance encoding. This project aim to the less consideration for the perceptual characteristics of the audio space information among the current multi-channel coding techniques, try to solve the problem that substantial distortion of perception of spatial information during compressing. Based on the existing spatial auditory experiment, we expand the traditional entropy theory to the calculation of the amount of space perception information and then build the perceived spatial information measure model to bring out the distortion measure of the perception spatial information. Moreover, we finish the des
英文关键词: Three-dimensional audio coding;perceptual distortion measure;spatial parameters;;;