与经常性选择性关注网络连续语音分离 (Continuous Speech Separation with Recurrent Selective Attention Network) - 专知论文

会员服务 ·

0

选择性注意力 · Continuity · 分离的 · CSS · Networking ·

2021 年 10 月 28 日

Continuous Speech Separation with Recurrent Selective Attention Network

翻译：与经常性选择性关注网络连续语音分离

Yixuan Zhang,Zhuo Chen,Jian Wu,Takuya Yoshioka,Peidong Wang,Zhong Meng,Jinyu Li

from arxiv, Submitted to ICASSP 2022

While permutation invariant training (PIT) based continuous speech separation (CSS) significantly improves the conversation transcription accuracy, it often suffers from speech leakages and failures in separation at "hot spot" regions because it has a fixed number of output channels. In this paper, we propose to apply recurrent selective attention network (RSAN) to CSS, which generates a variable number of output channels based on active speaker counting. In addition, we propose a novel block-wise dependency extension of RSAN by introducing dependencies between adjacent processing blocks in the CSS framework. It enables the network to utilize the separation results from the previous blocks to facilitate the current block processing. Experimental results on the LibriCSS dataset show that the RSAN-based CSS (RSAN-CSS) network consistently improves the speech recognition accuracy over PIT-based models. The proposed block-wise dependency modeling further boosts the performance of RSAN-CSS.

翻译：虽然以连续语音分离为基础的变异性培训(变异性培训)大大提高了谈话记录准确性,但往往会因为“热点”区域有固定数量的产出渠道而出现语言渗漏和分离失败。在本文中,我们提议对CSS采用经常性选择性关注网络(RSAN),这会产生基于主动语音计数的可变产出渠道。此外,我们提议通过在CSS框架内引入相邻加工区块之间的依赖性,对RSAN进行新的分块式依赖性扩展,使网络能够利用前几个区块的分离结果促进目前的区块处理。LibriCSS数据集的实验结果表明,基于RSAN的CSS(RSAN-CSS)网络不断提高基于PIT模型的语音识别准确性。拟议的块性依赖性模型进一步提升了RSAN-CSS的性能。

0

相关内容

选择性注意力

选择性注意力

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

专知会员服务

86+阅读 · 2020年6月23日

【AAAI 2020】双曲图注意力网络，Hyperbolic Graph Attention Network

【AAAI 2020】双曲图注意力网络，Hyperbolic Graph Attention Network

专知会员服务

94+阅读 · 2020年6月15日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

专知会员服务

78+阅读 · 2020年2月3日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

CIKM2020推荐系统论文集合

CIKM2020推荐系统论文集合

机器学习与推荐算法

10+阅读 · 2020年10月13日

Seq2seq框架下的文本生成

Seq2seq框架下的文本生成

DataFunTalk

3+阅读 · 2020年8月3日

【泡泡一分钟】对基于循环神经网络（RNN）框架叠加下异常检测的稀疏编码方法的再研究(ICCV2017-33)

【泡泡一分钟】对基于循环神经网络（RNN）框架叠加下异常检测的稀疏编码方法的再研究(ICCV2017-33)

泡泡机器人SLAM

4+阅读 · 2018年6月7日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【推荐】深度学习时序处理文献列表

【推荐】深度学习时序处理文献列表

机器学习研究会

7+阅读 · 2017年11月29日

Simple Recurrent Unit For Sentence Classification

Simple Recurrent Unit For Sentence Classification

哈工大SCIR

6+阅读 · 2017年11月29日

论文共读 | Attention is All You Need

论文共读 | Attention is All You Need

黑龙江大学自然语言处理实验室

14+阅读 · 2017年9月7日

【音乐】Attention

【音乐】Attention

英语演讲视频每日一推

3+阅读 · 2017年8月22日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

【深度学习基础】4. Recurrent Neural Networks

【深度学习基础】4. Recurrent Neural Networks

微信AI

16+阅读 · 2017年7月19日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

Convolutional Self-Attention Network

Arxiv

6+阅读 · 2019年4月8日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Neural Speech Synthesis with Transformer Network

Neural Speech Synthesis with Transformer Network

Arxiv

5+阅读 · 2019年1月30日

Neural source-filter-based waveform model for statistical parametric speech synthesis

Arxiv

4+阅读 · 2018年11月26日

Reversible Recurrent Neural Networks

Arxiv

3+阅读 · 2018年10月25日

LARNN: Linear Attention Recurrent Neural Network

LARNN: Linear Attention Recurrent Neural Network

Arxiv

5+阅读 · 2018年8月16日

Self-Attention Recurrent Network for Saliency Detection

Self-Attention Recurrent Network for Saliency Detection

Arxiv

5+阅读 · 2018年8月5日

Recursive Neural Network Based Preordering for English-to-Japanese Machine Translation

Arxiv

7+阅读 · 2018年5月25日

Dual Recurrent Attention Units for Visual Question Answering

Arxiv

7+阅读 · 2018年2月1日

VIP会员

文章信息

相关主题

选择性注意力

相关VIP内容

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

【DeepMind深度学习课程】序列循环神经网络，141页ppt，Sequences and Recurrent Network

专知会员服务

86+阅读 · 2020年6月23日

【AAAI 2020】双曲图注意力网络，Hyperbolic Graph Attention Network

【AAAI 2020】双曲图注意力网络，Hyperbolic Graph Attention Network

专知会员服务

94+阅读 · 2020年6月15日

一份循环神经网络RNNs简明教程，37页ppt

一份循环神经网络RNNs简明教程，37页ppt

专知会员服务

173+阅读 · 2020年5月6日

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

【MIT深度学习课程】深度序列建模，Deep Sequence Modeling

专知会员服务

78+阅读 · 2020年2月3日

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

【CCL 2019】ATT-第19期：文本生成 |Text Generation: From the Perspective of Interactive Inference （张家俊）

专知会员服务

43+阅读 · 2019年11月12日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

CIKM2020推荐系统论文集合

CIKM2020推荐系统论文集合

机器学习与推荐算法

10+阅读 · 2020年10月13日

Seq2seq框架下的文本生成

Seq2seq框架下的文本生成

DataFunTalk

3+阅读 · 2020年8月3日

【泡泡一分钟】对基于循环神经网络（RNN）框架叠加下异常检测的稀疏编码方法的再研究(ICCV2017-33)

【泡泡一分钟】对基于循环神经网络（RNN）框架叠加下异常检测的稀疏编码方法的再研究(ICCV2017-33)

泡泡机器人SLAM

4+阅读 · 2018年6月7日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【推荐】深度学习时序处理文献列表

【推荐】深度学习时序处理文献列表

机器学习研究会

7+阅读 · 2017年11月29日

Simple Recurrent Unit For Sentence Classification

Simple Recurrent Unit For Sentence Classification

哈工大SCIR

6+阅读 · 2017年11月29日

论文共读 | Attention is All You Need

论文共读 | Attention is All You Need

黑龙江大学自然语言处理实验室

14+阅读 · 2017年9月7日

【音乐】Attention

【音乐】Attention

英语演讲视频每日一推

3+阅读 · 2017年8月22日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

【深度学习基础】4. Recurrent Neural Networks

【深度学习基础】4. Recurrent Neural Networks

微信AI

16+阅读 · 2017年7月19日

相关论文

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

Convolutional Self-Attention Network

Arxiv

6+阅读 · 2019年4月8日

Phase-aware Speech Enhancement with Deep Complex U-Net

Phase-aware Speech Enhancement with Deep Complex U-Net

Arxiv

15+阅读 · 2019年3月7日

Neural Speech Synthesis with Transformer Network

Neural Speech Synthesis with Transformer Network

Arxiv

5+阅读 · 2019年1月30日

Neural source-filter-based waveform model for statistical parametric speech synthesis

Arxiv

4+阅读 · 2018年11月26日

Reversible Recurrent Neural Networks

Arxiv

3+阅读 · 2018年10月25日

LARNN: Linear Attention Recurrent Neural Network

LARNN: Linear Attention Recurrent Neural Network

Arxiv

5+阅读 · 2018年8月16日

Self-Attention Recurrent Network for Saliency Detection

Self-Attention Recurrent Network for Saliency Detection

Arxiv

5+阅读 · 2018年8月5日

Recursive Neural Network Based Preordering for English-to-Japanese Machine Translation

Arxiv

7+阅读 · 2018年5月25日

Dual Recurrent Attention Units for Visual Question Answering

Arxiv

7+阅读 · 2018年2月1日

微信扫码咨询专知VIP会员