DNSMOS P.835: 一种非侵入性概念性客观言语质量计量器,用于评价抑制噪音者 (DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors) - 专知论文

会员服务 ·

0

噪声 · 相关系数 · 得分 · 预测器/决策函数 · 优化器 ·

2022 年 2 月 4 日

DNSMOS P.835: A Non-Intrusive Perceptual Objective Speech Quality Metric to Evaluate Noise Suppressors

翻译：DNSMOS P.835: 一种非侵入性概念性客观言语质量计量器,用于评价抑制噪音者

Chandan K A Reddy,Vishak Gopal,Ross Cutler

from arxiv, arXiv admin note: substantial text overlap with arXiv:2010.15258

Human subjective evaluation is the gold standard to evaluate speech quality optimized for human perception. Perceptual objective metrics serve as a proxy for subjective scores. We have recently developed a non-intrusive speech quality metric called Deep Noise Suppression Mean Opinion Score (DNSMOS) using the scores from ITU-T Rec. P.808 subjective evaluation. The P.808 scores reflect the overall quality of the audio clip. ITU-T Rec. P.835 subjective evaluation framework gives the standalone quality scores of speech and background noise in addition to the overall quality. In this work, we train an objective metric based on P.835 human ratings that outputs 3 scores: i) speech quality (SIG), ii) background noise quality (BAK), and iii) the overall quality (OVRL) of the audio. The developed metric is highly correlated with human ratings, with a Pearson's Correlation Coefficient (PCC)=0.94 for SIG and PCC=0.98 for BAK and OVRL. This is the first non-intrusive P.835 predictor we are aware of. DNSMOS P.835 is made publicly available as an Azure service.

翻译：人类主观评价是评价为人类感知而优化的言语质量的黄金标准。概念客观指标是主观分数的替代物。我们最近利用ITU-T Rec. P.808主观评价的分数,开发了非侵入性言语质量指标,称为深噪音抑制平均意见评分(DNSMOS)。P.808分反映了音频剪辑的整体质量。ITU-T Rec. P.835主观评价框架除了总体质量外,还给出了单独的言语质量评分和背景噪音评分。在这项工作中,我们根据P.835人类评分培训了一种客观指标,结果3分为:i)言语质量(SIG)、ii)背景噪音质量(BAK)和iii),声音总体质量(OVRL)。开发的评分与人类评分高度相关,Pearson的调率(PCC)为0.94分,而BAK和VVRL为P.98分的PCC=0.98。这是我们所知道的ANSS P.835公开提供的Aser服务。

0

相关内容

【CVPR2022】机器人物体重排的迭代流最小化，IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

【CVPR2022】机器人物体重排的迭代流最小化，IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

专知会员服务

5+阅读 · 2022年3月2日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

专知会员服务

32+阅读 · 2019年12月26日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

CT图像重建中的分裂可行问题及其扩展形式的优化算法研究与实现

国家自然科学基金

0+阅读 · 2013年12月31日

极化层析SAR人造目标三维重构与特征提取研究

国家自然科学基金

1+阅读 · 2013年12月31日

LAMOST光谱质量控制和检查系统的软件实现和关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

GPU加速的视频抽象化和卡通化

国家自然科学基金

0+阅读 · 2009年12月31日

Robotic Inspection of Underground Utilities for Construction Survey Using a Ground Penetrating Radar

Arxiv

0+阅读 · 2022年4月19日

Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Arxiv

0+阅读 · 2022年4月17日

Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Arxiv

0+阅读 · 2022年4月15日

Interactive Object Segmentation in 3D Point Clouds

Arxiv

0+阅读 · 2022年4月14日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

【CVPR2022】机器人物体重排的迭代流最小化，IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

【CVPR2022】机器人物体重排的迭代流最小化，IFOR: Iterative Flow Minimization for Robotic Object Rearrangement

专知会员服务

5+阅读 · 2022年3月2日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

【WSDM 2020】RecVAE:一种新的变分自编码器，用于具有隐式反馈的Top-N推荐（RecVAE: a New Variational Autoencoder for Top-NRecommendations with Implicit Feedback）

专知会员服务

32+阅读 · 2019年12月26日

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

【AAAI2020接受论文】预测性参与:开放领域对话系统自动评估的有效指标（Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems）

专知会员服务

14+阅读 · 2019年11月15日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

前沿人工智能趋势报告（Frontier AI Trends Report）

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

音退化问题：基于输入操控的鲁棒语音转换综述

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Robotic Inspection of Underground Utilities for Construction Survey Using a Ground Penetrating Radar

Arxiv

0+阅读 · 2022年4月19日

Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons

Arxiv

0+阅读 · 2022年4月17日

Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications

Arxiv

0+阅读 · 2022年4月15日

Interactive Object Segmentation in 3D Point Clouds

Arxiv

0+阅读 · 2022年4月14日

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Predictive Engagement: An Efficient Metric For Automatic Evaluation of Open-Domain Dialogue Systems

Arxiv

11+阅读 · 2019年11月4日

相关基金

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

CT图像重建中的分裂可行问题及其扩展形式的优化算法研究与实现

国家自然科学基金

0+阅读 · 2013年12月31日

极化层析SAR人造目标三维重构与特征提取研究

国家自然科学基金

1+阅读 · 2013年12月31日

LAMOST光谱质量控制和检查系统的软件实现和关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

GPU加速的视频抽象化和卡通化

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员