保留话者特征的中远距离水声语音通信关键技术研究

项目名称： 保留话者特征的中远距离水声语音通信关键技术研究

项目编号： No.61302109

项目类型： 青年科学基金项目

立项/批准年度： 2014

项目学科： 无线电电子学、电信技术

项目作者： 肖东

作者单位： 中国科学院声学研究所

项目金额： 28万元

中文摘要： 水声语音通信中，水声信道中各种不利因素限制了语音编码速率，进而制约了话音质量。目前常见的低码速率语音编码算法通常只满足理解语义的要求，难以保留说话人特征。而可以保留说话人特征的语音编码算法的编码速率一般在1kbps量级以上，对于中远距离(>10km)水声语音通信来说偏高。本项目考虑到水声传播的时延较大的特点，为缓存较长时间的语音信号进行充分处理提供了条件。本项目计划以参数语音编码算法为参考，根据听觉感知特性细化语音信号类型；将有限的编码比特数尽可能多的分配给对听觉感知贡献的大和携带说话人特征的多的语音信号类型；考察采用压缩感知的方法对语音参数矩阵进行量化的可行性；寻找该变速率语音编码算法与现有水声通信系统的有效结合点，尽可能的提高带宽利用率。最终实现一个保留说话人特征的适用于中远距离的水声语音通信系统。为高话音质量的水声语音通信提供理论依据和技术基础。

中文关键词： 水声语音通信；低速率语音编码；保留话者特征；过渡音征；人耳听觉感知

英文摘要： In underwater acoustic speech communication, the bit rate of vocoder is affected by many disadvantages in underwater acoustic channel, such that the speech quality is also restricted. Usually, the low bit rate vocoders just meet the need of semantic comprehension. The individual characteristics of speakers can not be kept. In general, the bit rate of usual vocoders with retention of speakers' characteristics exceeds 1kbps, which is relatively higher for medium or long distance (>10km) underwater acoustic communication. This project considers that the relatively long time delay of underwater acoustic propagation provides conditions for buffering longer speech signal to sufficient processing. This project plans to refer to parametric speech coding algorithms and refine types of speech signal according to auditory perception features. The limited coding bits will be assigned to the types of speech signal, which contributes more for auditory perception and carries more individual characteristics of speakers, as many as possible. The feasibility of using compressed sensing method to quantize the speech parameter matrix will be investigated. An effective combination point of variable bit rate vocoder and the existed underwater acoustic communication system will be searched for, in order to increase the bandwidth utili

英文关键词： Underwater Acoustic Speech Communication；Low Bit Rate Speech Codec；Retention of Speakers' Characteristics；Transition；Human Auditory Perception

成为VIP会员查看完整内容