可视化和解释语音质量预测的深入学习模式 (Visualising and Explaining Deep Learning Models for Speech Quality Prediction)

Estimating quality of transmitted speech is known to be a non-trivial task. While traditionally, test participants are asked to rate the quality of samples; nowadays, automated methods are available. These methods can be divided into: 1) intrusive models, which use both, the original and the degraded signals, and 2) non-intrusive models, which only require the degraded signal. Recently, non-intrusive models based on neural networks showed to outperform signal processing based models. However, the advantages of deep learning based models come with the cost of being more challenging to interpret. To get more insight into the prediction models the non-intrusive speech quality prediction model NISQA is analyzed in this paper. NISQA is composed of a convolutional neural network (CNN) and a recurrent neural network (RNN). The task of the CNN is to compute relevant features for the speech quality prediction on a frame level, while the RNN models time-dependencies between the individual speech frames. Different explanation algorithms are used to understand the automatically learned features of the CNN. In this way, several interpretable features could be identified, such as the sensitivity to noise or strong interruptions. On the other hand, it was found that multiple features carry redundant information.

翻译：虽然传统上要求测试对象对样本质量进行评分;如今,有自动化的方法。这些方法可以分为:1)使用原始和退化信号的侵扰性模型,以及2)仅需要退化信号的非侵扰性模型。最近,神经网络上的非侵扰性模型显示超越信号处理模型,而神经网络上的非侵扰性模型显示超越了信号处理模型。然而,深层次学习模型的优点是,解释成本更具有挑战性。为了更深入地了解预测模型,本文分析了非侵扰性语音质量预测模型NISQA。NISQA是由一个革命性神经网络(CNN)和一个经常性神经网络(RNNN)组成的。CNN的任务是在框架水平上对语音质量预测的相关特征进行编译,而RNNN模型在单个语音框架之间的时间依赖性则使用不同的解释算法来理解CNNC自动学习的特征。通过这种方式可以找到一些可解释的特征,通过这种方式可以识别的特性,例如:一个革命性神经网络(CNNNNN)和一个经常性的神经网络(RNNNNNN)网络(RNNNNN)网络(RNNNNN)的多重敏感度。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

【课程】Andrew Ng与Google Brain团队联合出品《TensorFlow in Practice 》

专知会员服务

13+阅读 · 2019年10月29日