SA-SRD:会议风格数据分离的新损失功能 (SA-SDR: A novel loss function for separation of meeting style data) - 专知论文

会员服务 ·

0

分离的 · 损失函数（机器学习） · 泛函 · 稳健性 · Processing（编程语言） ·

2021 年 10 月 29 日

SA-SDR: A novel loss function for separation of meeting style data

翻译：SA-SRD:会议风格数据分离的新损失功能

Thilo von Neumann,Keisuke Kinoshita,Christoph Boeddeker,Marc Delcroix,Reinhold Haeb-Umbach

from arxiv, submitted to ICASSP 2022

Many state-of-the-art neural network-based source separation systems use the averaged Signal-to-Distortion Ratio (SDR) as a training objective function. The basic SDR is, however, undefined if the network reconstructs the reference signal perfectly or if the reference signal contains silence, e.g., when a two-output separator processes a single-speaker recording. Many modifications to the plain SDR have been proposed that trade-off between making the loss more robust and distorting its value. We propose to switch from a mean over the SDRs of each individual output channel to a global SDR over all output channels at the same time, which we call source-aggregated SDR (SA-SDR). This makes the loss robust against silence and perfect reconstruction as long as at least one reference signal is not silent. We experimentally show that our proposed SA-SDR is more stable and preferable over other well-known modifications when processing meeting-style data that typically contains many silent or single-speaker regions.

翻译：许多最先进的神经网络源分离系统使用平均信号对扭曲比率(SDR)作为培训目标功能。但是,如果网络完全重建参考信号,或者参考信号含有沉默,例如,当一个双输出分隔器处理单声波记录时,基本特别提款权是没有定义的。对普通特别提款权的许多修改建议是,在使损失更加稳健和扭曲其价值之间作出权衡。我们提议从每个单个输出渠道的比重转换为全球特别提款权,同时将所有产出渠道的比重转换为全球特别提款权,我们称之为源隔离特别提款权(SA-SDR)。只要至少有一个参考信号没有沉默,就使得失去沉默和完全重建成为强势。我们实验性地表明,在处理通常包含许多静音或单声调区域的会议模式数据时,我们提议的南南特别提款权比其他众所周知的修改更稳定、更可取。

0

相关内容

分离的

【ICML2021】 One-shot 权重共享神经网络结构搜索算法

专知会员服务

18+阅读 · 2021年8月4日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【CVPR2021】胶囊网络并不比卷积网络更鲁棒

专知会员服务

21+阅读 · 2021年4月1日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

简明扼要！Python教程手册，206页pdf

简明扼要！Python教程手册，206页pdf

专知会员服务

48+阅读 · 2020年3月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【资源】语音增强资源集锦

【资源】语音增强资源集锦

专知

8+阅读 · 2020年7月4日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Bayesian Optimization of Function Networks

Arxiv

0+阅读 · 2021年12月31日

A simple method for estimating the Lorenz curve

Arxiv

0+阅读 · 2021年12月31日

CertainNet: Sampling-free Uncertainty Estimation for Object Detection

Arxiv

0+阅读 · 2021年12月28日

Constructions of Binary Cross Z-Complementary Pairs With Large CZC Ratio

Arxiv

0+阅读 · 2021年12月28日

Learning Frequency Domain Approximation for Binary Neural Networks

Arxiv

3+阅读 · 2021年11月22日

Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

Arxiv

7+阅读 · 2021年8月17日

Asymmetric Loss For Multi-Label Classification

Arxiv

6+阅读 · 2020年9月29日

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss

Arxiv

3+阅读 · 2020年2月2日

Multi-Source Neural Machine Translation with Missing Data

Arxiv

5+阅读 · 2018年6月7日

Camera Style Adaptation for Person Re-identification

Arxiv

3+阅读 · 2018年4月10日

VIP会员

文章信息

相关主题

损失函数（机器学习）

Processing（编程语言）

相关VIP内容

【ICML2021】 One-shot 权重共享神经网络结构搜索算法

专知会员服务

18+阅读 · 2021年8月4日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

【CVPR2021】胶囊网络并不比卷积网络更鲁棒

专知会员服务

21+阅读 · 2021年4月1日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

简明扼要！Python教程手册，206页pdf

简明扼要！Python教程手册，206页pdf

专知会员服务

48+阅读 · 2020年3月24日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

【资源】语音增强资源集锦

【资源】语音增强资源集锦

专知

8+阅读 · 2020年7月4日

鲁棒机器学习相关文献集

鲁棒机器学习相关文献集

专知

8+阅读 · 2019年8月18日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Bayesian Optimization of Function Networks

Arxiv

0+阅读 · 2021年12月31日

A simple method for estimating the Lorenz curve

Arxiv

0+阅读 · 2021年12月31日

CertainNet: Sampling-free Uncertainty Estimation for Object Detection

Arxiv

0+阅读 · 2021年12月28日

Constructions of Binary Cross Z-Complementary Pairs With Large CZC Ratio

Arxiv

0+阅读 · 2021年12月28日

Learning Frequency Domain Approximation for Binary Neural Networks

Arxiv

3+阅读 · 2021年11月22日

Self-supervised Monocular Depth Estimation for All Day Images using Domain Separation

Arxiv

7+阅读 · 2021年8月17日

Asymmetric Loss For Multi-Label Classification

Arxiv

6+阅读 · 2020年9月29日

WaveTTS: Tacotron-based TTS with Joint Time-Frequency Domain Loss

Arxiv

3+阅读 · 2020年2月2日

Multi-Source Neural Machine Translation with Missing Data

Arxiv

5+阅读 · 2018年6月7日

Camera Style Adaptation for Person Re-identification

Arxiv

3+阅读 · 2018年4月10日

微信扫码咨询专知VIP会员