气候源分离:从厌食到回动环境 (Monaural source separation: From anechoic to reverberant environments) - 专知论文

会员服务 ·

0

Performer · 分离的 · 回合 · 优化器 · state-of-the-art ·

2022 年 5 月 10 日

Monaural source separation: From anechoic to reverberant environments

翻译：气候源分离:从厌食到回动环境

Tobias Cord-Landwehr,Christoph Boeddeker,Thilo von Neumann,Catalin Zorila,Rama Doddipatla,Reinhold Haeb-Umbach

from arxiv, Submitted to IWAENC 2022

Impressive progress in neural network-based single-channel speech source separation has been made in recent years. But those improvements have been mostly reported on anechoic data, a situation that is hardly met in practice. Taking the SepFormer as a starting point, which achieves state-of-the-art performance on anechoic mixtures, we gradually modify it to optimize its performance on reverberant mixtures. Although this leads to a word error rate improvement by 7 percentage points compared to the standard SepFormer implementation, the system ends up with only marginally better performance than a PIT-BLSTM separation system, that is optimized with rather straightforward means. This is surprising and at the same time sobering, challenging the practical usefulness of many improvements reported in recent years for monaural source separation on nonreverberant data.

翻译：近年来,在神经网络单通道语音源分离方面取得了令人印象深刻的进展。但是,这些改进大多是在厌食数据上报告的,这种情况在实践中几乎难以实现。以SepFormer为起点,实现了对厌食混合物的最先进的性能,我们逐渐修改它,以优化其在反动混合物上的性能。尽管这导致与标准SepFormer实施相比,单词误差率提高了7个百分点,但这个系统最终的性能仅略好于PIT-BLSTM分离系统,该系统以相当简单的方式优化。这令人吃惊,同时也令人清醒地挑战近年来报告的许多改进对于非循环数据在月源分离方面的实际效用。

0

相关内容

Performer

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

神经内分泌肿瘤特异性多功能纳米分子探针NIRF-CCPM-Octreotide的研究

国家自然科学基金

0+阅读 · 2013年12月31日

公路隧道中汽车尾气污染物MOFs基催化剂和吸附剂的研制

国家自然科学基金

0+阅读 · 2013年12月31日

基于WorldView-3和OP-ELM的矿化蚀变提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

补肺健脾方调控COPD大鼠骨骼肌能量代谢和细胞凋亡的研究

国家自然科学基金

0+阅读 · 2011年12月31日

航天测试及发射控制信息特征分析和分布式决策方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

JACK2: a new high-level communication library for parallel iterative methods

Arxiv

0+阅读 · 2022年6月30日

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

Arxiv

0+阅读 · 2022年6月30日

On the Storage Overhead of Proof-of-Work Blockchains

Arxiv

0+阅读 · 2022年6月30日

Parameter-Efficient Image-to-Video Transfer Learning

Arxiv

0+阅读 · 2022年6月27日

Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking

Arxiv

11+阅读 · 2021年1月7日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

前沿人工智能趋势报告（Frontier AI Trends Report）

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

JACK2: a new high-level communication library for parallel iterative methods

Arxiv

0+阅读 · 2022年6月30日

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

Arxiv

0+阅读 · 2022年6月30日

On the Storage Overhead of Proof-of-Work Blockchains

Arxiv

0+阅读 · 2022年6月30日

Parameter-Efficient Image-to-Video Transfer Learning

Arxiv

0+阅读 · 2022年6月27日

Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking

Arxiv

11+阅读 · 2021年1月7日

相关基金

神经内分泌肿瘤特异性多功能纳米分子探针NIRF-CCPM-Octreotide的研究

国家自然科学基金

0+阅读 · 2013年12月31日

公路隧道中汽车尾气污染物MOFs基催化剂和吸附剂的研制

国家自然科学基金

0+阅读 · 2013年12月31日

基于WorldView-3和OP-ELM的矿化蚀变提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

补肺健脾方调控COPD大鼠骨骼肌能量代谢和细胞凋亡的研究

国家自然科学基金

0+阅读 · 2011年12月31日

航天测试及发射控制信息特征分析和分布式决策方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员