促进提高发言效率的多方关注转让 (Multi-View Attention Transfer for Efficient Speech Enhancement) - 专知论文

会员服务 ·

0

语音增强 · Attention · Performer · MoDELS · 知识 (knowledge) ·

2022 年 8 月 22 日

Multi-View Attention Transfer for Efficient Speech Enhancement

翻译：促进提高发言效率的多方关注转让

Wooseok Shin,Hyun Joon Park,Jin Sob Kim,Byung Hoon Lee,Sung Won Han

from arxiv, Accepted by Interspeech 2022

Recent deep learning models have achieved high performance in speech enhancement; however, it is still challenging to obtain a fast and low-complexity model without significant performance degradation. Previous knowledge distillation studies on speech enhancement could not solve this problem because their output distillation methods do not fit the speech enhancement task in some aspects. In this study, we propose multi-view attention transfer (MV-AT), a feature-based distillation, to obtain efficient speech enhancement models in the time domain. Based on the multi-view features extraction model, MV-AT transfers multi-view knowledge of the teacher network to the student network without additional parameters. The experimental results show that the proposed method consistently improved the performance of student models of various sizes on the Valentini and deep noise suppression (DNS) datasets. MANNER-S-8.1GF with our proposed method, a lightweight model for efficient deployment, achieved 15.4x and 4.71x fewer parameters and floating-point operations (FLOPs), respectively, compared to the baseline model with similar performance.

翻译：最近深层次的学习模式在增强语言能力方面取得了很高的成绩;然而,在不出现显著的性能退化的情况下,获得快速和低复杂性模式仍是一项挑战; 以往关于增强语言能力的知识蒸馏研究无法解决这个问题,因为其产出蒸馏方法在某些方面不符合增强语言能力的任务; 在本研究中,我们提议采用多视调调换(MV-AT),即基于地貌的蒸馏,以便在时间范围内获得有效的增强语言能力模式; 根据多视特征提取模型,MV-AT将教师网络的多视知识传输到学生网络,而不增加参数; 实验结果显示,拟议的方法一贯地改进了瓦伦蒂和深噪音抑制(DNS)数据集不同尺寸学生模型的性能。 MANNER-S-8.1GF与我们拟议的方法,即高效部署的轻量模型,分别实现了15.4x和4.71x参数和浮点操作(FLOPs),与类似性能的基线模型相比,分别减少了15.4x和4.71x参数和浮点操作(FLOPs)。

0

相关内容

语音增强

语音增强是指当语音信号被各种各样的噪声干扰、甚至淹没后，从噪声背景中提取有用的语音信号，抑制、降低噪声干扰的技术。一句话，从含噪语音中提取尽可能纯净的原始语音。

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

MiR-133互作lncRNAs的鉴定及协同调控牛肌肉发育分化的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

PDCD5对多发性骨髓瘤survivin表达的影响及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

PPARγ调控miR-711/148b发挥抑制心肌梗死后内质网应激诱导心肌细胞凋亡的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-125b抑制心肌成纤维细胞分化改善心梗后心室重构的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

抑癌基因PDCD4调控miR-184和miR-374a抑制鼻咽癌生长及促进凋亡

国家自然科学基金

0+阅读 · 2012年12月31日

miR-499在急性心肌梗死早期辅助诊断及治疗中的作用及其分子机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

miR-140在肿瘤转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性介孔结构型超细粒子负载离子液体催化制备生物柴油研究

国家自然科学基金

0+阅读 · 2009年12月31日

活性氧-JNK-线粒体信号途径对脊髓损伤后神经元细胞凋亡的影响

国家自然科学基金

0+阅读 · 2009年12月31日

Compressed Vision for Efficient Video Understanding

Arxiv

0+阅读 · 2022年10月6日

Data Efficient 3D Learner via Knowledge Transferred from 2D Model

Arxiv

0+阅读 · 2022年10月6日

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

Neural Residual Flow Fields for Efficient Video Representations

Arxiv

0+阅读 · 2022年10月5日

Dual-former: Hybrid Self-attention Transformer for Efficient Image Restoration

Arxiv

0+阅读 · 2022年10月3日

Towards a Unified View on Visual Parameter-Efficient Transfer Learning

Arxiv

1+阅读 · 2022年10月3日

Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

Arxiv

0+阅读 · 2022年10月1日

An efficient encoder-decoder architecture with top-down attention for speech separation

Arxiv

0+阅读 · 2022年9月30日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

知识 (knowledge)

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Compressed Vision for Efficient Video Understanding

Arxiv

0+阅读 · 2022年10月6日

Data Efficient 3D Learner via Knowledge Transferred from 2D Model

Arxiv

0+阅读 · 2022年10月6日

Temporally Consistent Video Transformer for Long-Term Video Prediction

Arxiv

0+阅读 · 2022年10月5日

Neural Residual Flow Fields for Efficient Video Representations

Arxiv

0+阅读 · 2022年10月5日

Dual-former: Hybrid Self-attention Transformer for Efficient Image Restoration

Arxiv

0+阅读 · 2022年10月3日

Towards a Unified View on Visual Parameter-Efficient Transfer Learning

Arxiv

1+阅读 · 2022年10月3日

Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

Arxiv

0+阅读 · 2022年10月1日

An efficient encoder-decoder architecture with top-down attention for speech separation

Arxiv

0+阅读 · 2022年9月30日

MVFNet: Multi-View Fusion Network for Efficient Video Recognition

Arxiv

13+阅读 · 2021年1月5日

End-to-End Multi-Task Learning with Attention

Arxiv

19+阅读 · 2018年3月28日

相关基金

MiR-133互作lncRNAs的鉴定及协同调控牛肌肉发育分化的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

PDCD5对多发性骨髓瘤survivin表达的影响及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

PPARγ调控miR-711/148b发挥抑制心肌梗死后内质网应激诱导心肌细胞凋亡的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-125b抑制心肌成纤维细胞分化改善心梗后心室重构的机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

抑癌基因PDCD4调控miR-184和miR-374a抑制鼻咽癌生长及促进凋亡

国家自然科学基金

0+阅读 · 2012年12月31日

miR-499在急性心肌梗死早期辅助诊断及治疗中的作用及其分子机制探讨

国家自然科学基金

0+阅读 · 2012年12月31日

miR-140在肿瘤转移中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

磁性介孔结构型超细粒子负载离子液体催化制备生物柴油研究

国家自然科学基金

0+阅读 · 2009年12月31日

活性氧-JNK-线粒体信号途径对脊髓损伤后神经元细胞凋亡的影响

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员