控制将单独对话与非侵入性质量估计相结合 (Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate) - 专知论文

会员服务 ·

0

估计/估计量 · 分离的 · 任务对话系统 · 相关系数 · 控制器 ·

2021 年 7 月 21 日

Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate

翻译：控制将单独对话与非侵入性质量估计相结合

Matteo Torcoli,Jouni Paulus,Thorsten Kastner,Christian Uhle

from arxiv, Manuscript accepted for the 2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

Remixing separated audio sources trades off interferer attenuation against the amount of audible deteriorations. This paper proposes a non-intrusive audio quality estimation method for controlling this trade-off in a signal-adaptive manner. The recently proposed 2f-model is adopted as the underlying quality measure, since it has been shown to correlate strongly with basic audio quality in source separation. An alternative operation mode of the measure is proposed, more appropriate when considering material with long inactive periods of the target source. The 2f-model requires the reference target source as an input, but this is not available in many applications. Deep neural networks (DNNs) are trained to estimate the 2f-model intrusively using the reference target (iDNN2f), non-intrusively using the input mix as reference (nDNN2f), and reference-free using only the separated output signal (rDNN2f). It is shown that iDNN2f achieves very strong correlation with the original measure on the test data (Pearson r=0.99), while performance decreases for nDNN2f (r>=0.91) and rDNN2f (r>=0.82). The non-intrusive estimate nDNN2f is mapped to select item-dependent remixing gains with the aim of maximizing the interferer attenuation under a constraint on the minimum quality of the remixed output (e.g., audible but not annoying deteriorations). A listening test shows that this is successfully achieved even with very different selected gains (up to 23 dB difference).

翻译：调整分离的音频源时, 将干扰器的衰减与听觉变坏的数量进行交换。本文建议采用非侵入性音频质量估计方法, 以信号适应方式控制这种交换。最近提议的 2f 模型被作为基本质量衡量标准, 因为已经显示它与源分离的基本音频质量密切相关。在考虑目标源长期不活动期间的材料时, 提出该措施的替代操作模式更为合适。 2f 模型需要参考目标源作为输入, 但许多应用程序中都找不到这个源。深神经网络( DNNNN) 受过培训, 使用参考目标( iDNN2f) 来对2f 模型进行侵扰动性评估( iDNN2f) 模型, 仅使用分离输出信号( rDNN2fff) 。显示与测试数据( Pearson r=0. 99) 的原始测量非常强烈的关联性, 而对于 NNN2 和 IM 目标的最小性测试值则显示在 NN_r=x 的最小性测试值下, 。

0

相关内容

估计/估计量

估计/估计量

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【AAAI2021】元标签校正的噪声标签学习

专知会员服务

31+阅读 · 2020年12月7日

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

专知会员服务

22+阅读 · 2020年11月13日

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

专知会员服务

62+阅读 · 2020年1月11日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Noisy-to-Noisy Voice Conversion Framework with Denoising Model

Arxiv

0+阅读 · 2021年9月22日

Privacy-preserving Credit Scoring via Functional Encryption

Arxiv

0+阅读 · 2021年9月22日

$A spatially adaptive phase-field model of fracture$

A spatially adaptive phase-field model of fracture

Arxiv

0+阅读 · 2021年9月21日

Physics-based Human Motion Estimation and Synthesis from Videos

Arxiv

0+阅读 · 2021年9月21日

Encrypted Data Processing

Arxiv

0+阅读 · 2021年9月20日

Bayesian Paired-Comparison with the bpcs Package

Arxiv

0+阅读 · 2021年9月20日

Making the Most of Parallel Composition in Differential Privacy

Arxiv

0+阅读 · 2021年9月19日

An Alignment-Agnostic Model for Chinese Text Error Correction

Arxiv

0+阅读 · 2021年9月18日

Remember the context! ASR slot error correction through memorization

Arxiv

0+阅读 · 2021年9月18日

Distributed Sequential Hypothesis Testing With Zero-Rate Compression

Arxiv

0+阅读 · 2021年9月17日

VIP会员

文章信息

相关主题

估计/估计量

任务对话系统

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【AAAI2021】元标签校正的噪声标签学习

专知会员服务

31+阅读 · 2020年12月7日

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

【NeurIPS 2020】图神经网络的参数化解释器，Parameterized Explainer for GNN

专知会员服务

22+阅读 · 2020年11月13日

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

专知会员服务

62+阅读 · 2020年1月11日

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

【Google可解释人工智能白皮书】27页pdf，AI Explainability Whitepaper ，Introduction to AI Explanations for AI Platform

专知会员服务

127+阅读 · 2019年12月13日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多模态数据存储与检索：综述

《算法战争研究计划全景评估》35页

【CMU博士论文】水下三维视觉感知与生成

智能体战争：自主人工智能军备竞赛全景透视

相关资讯

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

carla 学习笔记

carla 学习笔记

CreateAMind

9+阅读 · 2018年2月7日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Noisy-to-Noisy Voice Conversion Framework with Denoising Model

Arxiv

0+阅读 · 2021年9月22日

Privacy-preserving Credit Scoring via Functional Encryption

Arxiv

0+阅读 · 2021年9月22日

$A spatially adaptive phase-field model of fracture$

A spatially adaptive phase-field model of fracture

Arxiv

0+阅读 · 2021年9月21日

Physics-based Human Motion Estimation and Synthesis from Videos

Arxiv

0+阅读 · 2021年9月21日

Encrypted Data Processing

Arxiv

0+阅读 · 2021年9月20日

Bayesian Paired-Comparison with the bpcs Package

Arxiv

0+阅读 · 2021年9月20日

Making the Most of Parallel Composition in Differential Privacy

Arxiv

0+阅读 · 2021年9月19日

An Alignment-Agnostic Model for Chinese Text Error Correction

Arxiv

0+阅读 · 2021年9月18日

Remember the context! ASR slot error correction through memorization

Arxiv

0+阅读 · 2021年9月18日

Distributed Sequential Hypothesis Testing With Zero-Rate Compression

Arxiv

0+阅读 · 2021年9月17日

微信扫码咨询专知VIP会员