改进语音识别室电动反应估计 (Towards Improved Room Impulse Response Estimation for Speech Recognition) - 专知论文

会员服务 ·

0

估计/估计量 · 语音识别 · Reverberation · state-of-the-art · 自动语音识别 ·

2022 年 11 月 8 日

Towards Improved Room Impulse Response Estimation for Speech Recognition

翻译：改进语音识别室电动反应估计

Anton Ratnarajah,Ishwarya Ananthabhotla,Vamsi Krishna Ithapu,Pablo Hoffmann,Dinesh Manocha,Paul Calamia

We propose to characterize and improve the performance of blind room impulse response (RIR) estimation systems in the context of a downstream application scenario, far-field automatic speech recognition (ASR). We first draw the connection between improved RIR estimation and improved ASR performance, as a means of evaluating neural RIR estimators. We then propose a GAN-based architecture that encodes RIR features from reverberant speech and constructs an RIR from the encoded features, and uses a novel energy decay relief loss to optimize for capturing energy-based properties of the input reverberant speech. We show that our model outperforms the state-of-the-art baselines on acoustic benchmarks (by 72% on the energy decay relief and 22% on an early-reflection energy metric), as well as in an ASR evaluation task (by 6.9% in word error rate).

翻译：我们提议在下游应用情景(远野自动语音识别(ASR))中描述和改进盲室冲动反应估计系统(RIR)的性能。我们首先将改进RIR估计与改进ASR性能挂钩,以此作为评价神经RIR测算器的一种手段。然后我们提议一个基于GAN的架构,将RIR的性能从回声中编码,并根据编码特征构建RIR,并使用新的能源衰减救济损失优化,以捕捉输入反动词的能量特性。我们显示,我们的模型优于声学基准的最新基线(以72%的能量衰减率和22%的早期反射能度衡量标准),以及ASR的评估工作(以6.9%,文字误差率)。

0

相关内容

估计/估计量

估计/估计量

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

IRF5诱导小胶质细胞极化在脑缺血后神经元损伤中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

热障涂层TGO与热疲劳损伤的激光超声无损检测机理与关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

LNPEP基因与汉族人银屑病发病机制相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于整合素信号通路探讨补肾填精法治疗慢性再生障碍性贫血的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Bayesian Generalized Kernel Inference for Exploration of Autonomous Robots

Arxiv

0+阅读 · 2023年1月2日

Multi-modal deep learning system for depression and anxiety detection

Arxiv

0+阅读 · 2022年12月30日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

VIP会员

文章信息

相关主题

估计/估计量

state-of-the-art

自动语音识别

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

热门VIP内容

开通专知VIP会员享更多权益服务

【书籍】从零开始构建文本生成图像生成器：基于 Transformers 与扩散模型

人工智能与未来指挥

【伯克利博士论文】将大语言模型绑定至虚拟人格：实现人类行为模拟

稀疏自编码器综述：解释大语言模型的内部机制

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

相关论文

Bayesian Generalized Kernel Inference for Exploration of Autonomous Robots

Arxiv

0+阅读 · 2023年1月2日

Multi-modal deep learning system for depression and anxiety detection

Arxiv

0+阅读 · 2022年12月30日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

Feature Denoising for Improving Adversarial Robustness

Feature Denoising for Improving Adversarial Robustness

Arxiv

15+阅读 · 2018年12月9日

相关基金

IRF5诱导小胶质细胞极化在脑缺血后神经元损伤中的作用及机制

国家自然科学基金

0+阅读 · 2014年12月31日

有理映射的参数空间

国家自然科学基金

0+阅读 · 2013年12月31日

热障涂层TGO与热疲劳损伤的激光超声无损检测机理与关键技术研究

国家自然科学基金

0+阅读 · 2012年12月31日

LNPEP基因与汉族人银屑病发病机制相关性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于整合素信号通路探讨补肾填精法治疗慢性再生障碍性贫血的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员