Without the need of a clean reference, non-intrusive speech assessment methods have caught great attention for objective evaluations. Recently, deep neural network (DNN) models have been applied to build non-intrusive speech assessment approaches and confirmed to provide promising performance. However, most DNN-based approaches are designed for normal-hearing listeners without considering hearing-loss factors. In this study, we propose a DNN-based hearing aid speech assessment network (HASA-Net), formed by a bidirectional long short-term memory (BLSTM) model, to predict speech quality and intelligibility scores simultaneously according to input speech signals and specified hearing-loss patterns. To the best of our knowledge, HASA-Net is the first work to incorporate quality and intelligibility assessments utilizing a unified DNN-based non-intrusive model for hearing aids. Experimental results show that the predicted speech quality and intelligibility scores of HASA-Net are highly correlated to two well-known intrusive hearing-aid evaluation metrics, hearing aid speech quality index (HASQI) and hearing aid speech perception index (HASPI), respectively.
翻译:最近,深神经网络(DNN)模型被用于建立非侵入性语言评估方法,并被证实可提供有希望的业绩;然而,大多数基于DNN的方法是为正常听力的听众设计的,而不考虑听力损失因素;在这项研究中,我们提议一个基于DNN的助听语评估网络(HASA-Net),由双向长期短期内存(BLSTM)模型组成,根据输入性语言信号和特定听力损失模式,同时预测语音质量和智能分数;就我们所知,HASA-Net是利用统一的DNN非侵入性助听器模型纳入质量和智能评估的第一个工作;实验结果显示,HASA-Net的预测语音质量和智能分数与两个众所周知的侵入性听力援助评价指标、听力语音质量指数(HASQI)和助听力语音感知觉指数(HASPI)高度相关。