There is a considerable interest in developing algorithms that can predict audio quality of perceptually coded signals to avoid the cost of extensive listening tests during development time. While many established algorithms for predicting the perceived quality of signals with monaural (timbral) distortions are available (PEAQ, POLQA), predicting the quality degradation of stereo and multi-channel spatial signals is still considered a challenge. Audio quality degradation arising from spatial distortions is usually measured in terms of well known inter-aural cue distortion measures such as Inter-aural Level Difference Distortions (ILDD), Inter-aural Time Difference Distortions (ITDD) and Inter-aural Cross Correlation Distortions (IACCD). However, the extent to which their interaction influences the overall audio quality degradation in complex signals as expressed - for example - in a multiple stimuli test is not yet thoroughly studied. We propose a systematic approach that introduces controlled combinations of spatial distortions on a representative set of signals and evaluates their influence on overall perceived quality degradation by analyzing listening test scores over said signals. From this study we derive guidelines for designing meaningful distortion measures that consider inter-aural cue distortion interactions.
翻译:发展算法,可以预测感知编码信号的音质质量,以避免在开发期间进行广泛的听觉测试的成本; 虽然有许多既定算法可以预测带有寺庙(临时)扭曲的信号的感知质量(PEAQ, POLQA),但预测立体和多通道空间信号的质量退化仍被视为一项挑战; 空间扭曲造成的音质退化通常以众所周知的跨界信号扭曲措施来衡量,如跨级水平差异扭曲(ILDD)、跨时差异扭曲(ITDD)和跨度交叉关联扭曲(IACCD)等; 然而,对于这些信号的相互作用在多大程度上影响复杂信号的整体音质退化,例如,在多次刺激测试中表达的音质和多通道空间信号,还没有进行透彻的研究; 我们建议一种系统的方法,将空间扭曲的受控组合引入具有代表性的信号集,并通过分析对所述信号的监听测试分来评估其对总体觉察质量退化的影响。 我们从这项研究中得出设计有意义的扭曲措施的指导方针,考虑跨度扭曲措施。