引导语音增强网络</s> (Guided Speech Enhancement Network) - 专知论文

会员服务 ·

0

语音增强 · Networking · MoDELS · 相互独立的 · 输出 ·

2023 年 3 月 13 日

Guided Speech Enhancement Network

翻译：引导语音增强网络

Yang Yang,Shao-Fu Shih,Hakan Erdogan,Jamie Menjay Lin,Chehung Lee,Yunpeng Li,George Sung,Matthias Grundmann

from arxiv, Accepted to ICASSP 2023

High quality speech capture has been widely studied for both voice communication and human computer interface reasons. To improve the capture performance, we can often find multi-microphone speech enhancement techniques deployed on various devices. Multi-microphone speech enhancement problem is often decomposed into two decoupled steps: a beamformer that provides spatial filtering and a single-channel speech enhancement model that cleans up the beamformer output. In this work, we propose a speech enhancement solution that takes both the raw microphone and beamformer outputs as the input for an ML model. We devise a simple yet effective training scheme that allows the model to learn from the cues of the beamformer by contrasting the two inputs and greatly boost its capability in spatial rejection, while conducting the general tasks of denoising and dereverberation. The proposed solution takes advantage of classical spatial filtering algorithms instead of competing with them. By design, the beamformer module then could be selected separately and does not require a large amount of data to be optimized for a given form factor, and the network model can be considered as a standalone module which is highly transferable independently from the microphone array. We name the ML module in our solution as GSENet, short for Guided Speech Enhancement Network. We demonstrate its effectiveness on real world data collected on multi-microphone devices in terms of the suppression of noise and interfering speech.

翻译：为语音通信和人类计算机界面的原因,对高质量语音捕捉进行了广泛的研究。为了改进捕捉性能,我们常常可以找到在各种装置上部署的多声扩音技术。多声扩音问题往往被分解成两个分解的步骤:一个提供空间过滤器的波束装置和一个清理波束输出的单声道扩音模型。在这项工作中,我们提出了一个语音增强解决方案,将原始麦克风和波束输出作为ML模型的输入。我们设计了一个简单而有效的培训计划,使模型能够通过对比两种输入并大大提升其在空间拒绝方面的能力,同时进行拆音和皮肤变异的一般任务。拟议解决方案利用经典空间过滤算法而不是与它们竞争。通过设计,然后可以单独选择波束模模模模模模块,而不需要大量的数据来优化特定的形式要素。我们可以将网络模型视为一个独立的独立模块,该模块在空间阻断方面可以高度可转让,同时进行空间阻断,同时进行空间阻断和降低空间阻断能力。我们收集的GS-LM-L 将数据定位模块用于真正的磁感应系统。</s>

0

相关内容

语音增强

语音增强是指当语音信号被各种各样的噪声干扰、甚至淹没后，从噪声背景中提取有用的语音信号，抑制、降低噪声干扰的技术。一句话，从含噪语音中提取尽可能纯净的原始语音。

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

随机机械系统的建模和控制问题

国家自然科学基金

1+阅读 · 2015年12月31日

助剂修饰增强银基光催化材料的稳定性与光催化活性

国家自然科学基金

0+阅读 · 2014年12月31日

NET基因启动子区DNA甲基化及组蛋白修饰在抑郁症与高血压相关性中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于动态全光调控表面等离激元的宽场超分辨光学显微成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

高频强迫振动和自激振动耦合型摩擦驱动机理及其在超声电机中的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

超磁致伸缩材料的非线性动力学及驱动器控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面结构的多尺度融合测量方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于PMN-PT单晶的层状结构中弹性波传播特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

航空发动机疲劳寿命预测及故障诊断研究

国家自然科学基金

5+阅读 · 2008年12月31日

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月5日

Rethinking Population-assisted Off-policy Reinforcement Learning

Arxiv

0+阅读 · 2023年5月4日

Towards Complex Document Understanding By Discrete Reasoning

Arxiv

0+阅读 · 2023年5月4日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning

Arxiv

12+阅读 · 2021年12月28日

Learning Heuristics over Large Graphs via Deep Reinforcement Learning

Arxiv

12+阅读 · 2019年3月8日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Arxiv

16+阅读 · 2018年1月31日

Distance-based Self-Attention Network for Natural Language Inference

Arxiv

10+阅读 · 2017年12月6日

VIP会员

文章信息

相关主题

相互独立的

相关VIP内容

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身智能的多传感器融合感知综述：背景、方法、挑战与前景

《基于深度学习模型的图像军事目标检测》

TKDE | 推荐系统鲁棒性全面综述及鲁棒性评测库

中文版 | 战场创新：以色列-伊朗与俄罗斯-乌克兰战场如何重塑现代战争

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】全卷积语义分割综述

【推荐】全卷积语义分割综述

机器学习研究会

19+阅读 · 2017年8月31日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Arxiv

0+阅读 · 2023年5月5日

Rethinking Population-assisted Off-policy Reinforcement Learning

Arxiv

0+阅读 · 2023年5月4日

Towards Complex Document Understanding By Discrete Reasoning

Arxiv

0+阅读 · 2023年5月4日

Pretraining in Deep Reinforcement Learning: A Survey

Arxiv

21+阅读 · 2022年11月8日

Exploiting Fine-grained Face Forgery Clues via Progressive Enhancement Learning

Arxiv

12+阅读 · 2021年12月28日

Learning Heuristics over Large Graphs via Deep Reinforcement Learning

Arxiv

12+阅读 · 2019年3月8日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Deep Reinforcement Learning: An Overview

Arxiv

15+阅读 · 2018年6月23日

Reinforced Self-Attention Network: a Hybrid of Hard and Soft Attention for Sequence Modeling

Arxiv

16+阅读 · 2018年1月31日

Distance-based Self-Attention Network for Natural Language Inference

Arxiv

10+阅读 · 2017年12月6日

相关基金

随机机械系统的建模和控制问题

国家自然科学基金

1+阅读 · 2015年12月31日

助剂修饰增强银基光催化材料的稳定性与光催化活性

国家自然科学基金

0+阅读 · 2014年12月31日

NET基因启动子区DNA甲基化及组蛋白修饰在抑郁症与高血压相关性中的作用机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于动态全光调控表面等离激元的宽场超分辨光学显微成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

高频强迫振动和自激振动耦合型摩擦驱动机理及其在超声电机中的应用研究

国家自然科学基金

0+阅读 · 2012年12月31日

超磁致伸缩材料的非线性动力学及驱动器控制研究

国家自然科学基金

0+阅读 · 2012年12月31日

表面结构的多尺度融合测量方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于PMN-PT单晶的层状结构中弹性波传播特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

航空发动机疲劳寿命预测及故障诊断研究

国家自然科学基金

5+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员