项目名称: 三维声场中声源水平定位线索感知特性测量与分析
项目编号: No.61201340
项目类型: 青年科学基金项目
立项/批准年度: 2013
项目学科: 电子学与信息系统
项目作者: 涂卫平
作者单位: 武汉大学
项目金额: 25万元
中文摘要: 3D影视的兴起使三维音视频技术成为信息与通信行业的技术前沿和研究热点。三维音频系统声道数多、数据量大。通过简化系统和压缩数据来降低三维音频系统的声道数和数据量,是三维音频系统亟待解决的重要问题。双耳线索是三维声场中人耳对声源水平方位定位的重要依据,但是人耳对不同方位声源的双耳线索的感知敏感度具有较大的差异性。本项目针对双耳线索的感知敏感度特性,在水平面多个方位测量双耳线索的感知阈值;通过数学插值建立基于空间分布和频率特性的双耳线索感知阈值曲面,将传统的双耳线索感知阈值研究由定性描述拓展到定量分析;将该曲面模型应用于水平多声道信号的高效编码,提出基于感知阈值的多声道信号空间参数感知无失真量化方法。在传统单声道编码码率条件下,增加8kbps空间参数信息可获得优质的立体声重建信号。与现有参数立体声编码器相比,重建音质提升10个MUSHRA得分。研究成果对于三维音频信号的高效表达具有指导意义。
中文关键词: 声源定位;双耳线索;临界可感知阈值;心理生理学;空间音频编码
英文摘要: The rapidly development of 3D films makes 3D audio and video technology as the frontier and hotspot of information and communication industry. 3D audio system has the feature of more channels and a huge amount of data. To reduce channels and data sizes of 3D audio system by simplifying the system and compressing data is a pressing task that demands prompt resolving. Binaural cues are the important basis for human auditory system to localize sound sources in horizontal plane. But there is significant difference in perceptual sensitivity of binaural cues of sound sources with different azimuth. To the perceptual discrimination of binaural cues, this proposal measures perceptual threshold of binaural cues with multiple directions in azimuth plane. Mathematical interpolation method is used to set up curved surface of threshold based on spatial distribution and frequency characteristics of sound sources. The tranditional research on perceptual threshold of binaural cues is expanded from qualitative description to quantitative analysis. The surface model is used to efficient coding of mulit-channel audio signals in horizontal plane and a quantization method of spatial parameters based on perceptual threshold of binaural cues is proposed. On the condition of equal bitrate to tranditional mono audio coders, excellent
英文关键词: source localization;Binaural Cues;Just Noticeable Difference;Psychophysics;Spatial Audio Coding