音频相似性作为音频质量的代理工具是不可靠的 (Audio Similarity is Unreliable as a Proxy for Audio Quality) - 专知论文

会员服务 ·

0

相似度 · 相关系数 · 分解的 · 全 · Better ·

2022 年 6 月 27 日

Audio Similarity is Unreliable as a Proxy for Audio Quality

翻译：音频相似性作为音频质量的代理工具是不可靠的

Pranay Manocha,Zeyu Jin,Adam Finkelstein

from arxiv, To Appear, Interspeech 2022

Many audio processing tasks require perceptual assessment. However, the time and expense of obtaining ``gold standard'' human judgments limit the availability of such data. Most applications incorporate full reference or other similarity-based metrics (e.g. PESQ) that depend on a clean reference. Researchers have relied on such metrics to evaluate and compare various proposed methods, often concluding that small, measured differences imply one is more effective than another. This paper demonstrates several practical scenarios where similarity metrics fail to agree with human perception, because they: (1) vary with clean references; (2) rely on attributes that humans factor out when considering quality, and (3) are sensitive to imperceptible signal level differences. In those scenarios, we show that no-reference metrics do not suffer from such shortcomings and correlate better with human perception. We conclude therefore that similarity serves as an unreliable proxy for audio quality.

翻译：许多音频处理任务需要感知评估。然而,获得“黄金标准”人类判断的时间和费用限制了这些数据的可用性。大多数应用都包含完全参考或依赖清洁参考的基于相似度的衡量标准(例如PESQ),研究人员依靠这些衡量标准来评价和比较各种拟议方法,往往认为小的、计量的差异意味着一种方法比另一种方法更有效。本文件展示了一些实际情景,其中相似度指标不能与人的看法一致,因为它们:(1) 与清洁参考标准不同;(2) 在考虑质量时依赖人类因素的属性;(3) 敏感地注意无法察觉的信号水平差异。在这些情形中,我们表明,不参考度指标没有受到这种缺陷的影响,而且与人的看法更相关。因此,我们的结论是,类似性作为声音质量的不可靠的替代物。

0

相关内容

相似度

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

CaR在TMJ骨关节炎关节软骨细胞异常增殖与分化中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

系统性红斑狼疮血脑屏障损伤的动态增强磁共振检测及分子免疫机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

炎症调节因子c-REL抑制TAp73的抗骨肉瘤作用的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

amiRNA干扰NMHC II-A对PRRSV感染细胞凋亡信号传导的影响及机制

国家自然科学基金

0+阅读 · 2012年12月31日

聚合物薄膜场效应晶体管材料的设计、合成与器件化

国家自然科学基金

0+阅读 · 2012年12月31日

压控下BMMSCs/PRF双层复合体治疗塌陷后股骨头坏死软骨损伤的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

靶向GRPR的铁蛋白笼状嵌合体多功能分子探针的构建及其PET/MRI/NIRF三模式显像的探索研究

国家自然科学基金

0+阅读 · 2011年12月31日

B细胞刺激因子受体BAFF-R、BCMA和TACI介导信号和相互关系在胶原性关节炎发病中作用及受体抑制剂对其的影响

国家自然科学基金

0+阅读 · 2011年12月31日

HAT/HDAC失衡与乙酰化修饰异常：急性肺损伤炎症失控新机制

国家自然科学基金

0+阅读 · 2009年12月31日

温针干预兔软骨退变的信号转导机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

Arxiv

0+阅读 · 2022年8月17日

Knowledge Graph Curation: A Practical Framework

Arxiv

0+阅读 · 2022年8月17日

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Arxiv

0+阅读 · 2022年8月17日

SupMAE: Supervised Masked Autoencoders Are Efficient Vision Learners

Arxiv

0+阅读 · 2022年8月16日

The Weighting Game: Evaluating Quality of Explainability Methods

Arxiv

0+阅读 · 2022年8月12日

AV-Gaze: A Study on the Effectiveness of Audio Guided Visual Attention Estimation for Non-Profilic Faces

Arxiv

0+阅读 · 2022年8月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

Approaches for Enriching and Improving Textual Knowledge Bases

Arxiv

15+阅读 · 2018年4月20日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Deep Research（深度研究）：系统性综述

《革新战术战场空间能力：反无人机系统》报告

【普林斯顿博士论文】用于语音的生成式通用模型

螺旋式开发作为战略资产：美军启示

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

【推荐】ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

机器学习研究会

20+阅读 · 2017年12月17日

相关论文

Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

Arxiv

0+阅读 · 2022年8月17日

Knowledge Graph Curation: A Practical Framework

Arxiv

0+阅读 · 2022年8月17日

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Arxiv

0+阅读 · 2022年8月17日

SupMAE: Supervised Masked Autoencoders Are Efficient Vision Learners

Arxiv

0+阅读 · 2022年8月16日

The Weighting Game: Evaluating Quality of Explainability Methods

Arxiv

0+阅读 · 2022年8月12日

AV-Gaze: A Study on the Effectiveness of Audio Guided Visual Attention Estimation for Non-Profilic Faces

Arxiv

0+阅读 · 2022年8月12日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

Approaches for Enriching and Improving Textual Knowledge Bases

Arxiv

15+阅读 · 2018年4月20日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

相关基金

CaR在TMJ骨关节炎关节软骨细胞异常增殖与分化中的作用研究

国家自然科学基金

0+阅读 · 2014年12月31日

系统性红斑狼疮血脑屏障损伤的动态增强磁共振检测及分子免疫机制的研究

国家自然科学基金

0+阅读 · 2014年12月31日

炎症调节因子c-REL抑制TAp73的抗骨肉瘤作用的分子机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

amiRNA干扰NMHC II-A对PRRSV感染细胞凋亡信号传导的影响及机制

国家自然科学基金

0+阅读 · 2012年12月31日

聚合物薄膜场效应晶体管材料的设计、合成与器件化

国家自然科学基金

0+阅读 · 2012年12月31日

压控下BMMSCs/PRF双层复合体治疗塌陷后股骨头坏死软骨损伤的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

靶向GRPR的铁蛋白笼状嵌合体多功能分子探针的构建及其PET/MRI/NIRF三模式显像的探索研究

国家自然科学基金

0+阅读 · 2011年12月31日

B细胞刺激因子受体BAFF-R、BCMA和TACI介导信号和相互关系在胶原性关节炎发病中作用及受体抑制剂对其的影响

国家自然科学基金

0+阅读 · 2011年12月31日

HAT/HDAC失衡与乙酰化修饰异常：急性肺损伤炎症失控新机制

国家自然科学基金

0+阅读 · 2009年12月31日

温针干预兔软骨退变的信号转导机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员