Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation - 专知论文

会员服务 ·

0

Reverberation · Performer · 估计/估计量 · MoDELS · 自动语音识别 ·

2023 年 5 月 25 日

Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation

翻译：暂无翻译

Marvin Lavechin,Marianne Métais,Hadrien Titeux,Alodie Boissonnet,Jade Copet,Morgane Rivière,Elika Bergelson,Alejandrina Cristia,Emmanuel Dupoux,Hervé Bredin

Most automatic speech processing systems register degraded performance when applied to noisy or reverberant speech. But how can one tell whether speech is noisy or reverberant? We propose Brouhaha, a neural network jointly trained to extract speech/non-speech segments, speech-to-noise ratios, and C50room acoustics from single-channel recordings. Brouhaha is trained using a data-driven approach in which noisy and reverberant audio segments are synthesized. We first evaluate its performance and demonstrate that the proposed multi-task regime is beneficial. We then present two scenarios illustrating how Brouhaha can be used on naturally noisy and reverberant data: 1) to investigate the errors made by a speaker diarization model (pyannote.audio); and 2) to assess the reliability of an automatic speech recognition model (Whisper from OpenAI). Both our pipeline and a pretrained model are open source and shared with the speech community.

翻译：暂无翻译

0

相关内容

Reverberation

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

CHCHD10在双酚A致精子线粒体氧化磷酸化障碍中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cdk5介导的Drp1磷酸化对线粒体的调控及其在阿尔茨海默病中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

鱼类清道夫受体识别和调控细菌炎症信号的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

MIMO检测与合并中的智能信号处理研究

国家自然科学基金

0+阅读 · 2009年12月31日

Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Arxiv

0+阅读 · 2023年7月13日

Automated Deception Detection from Videos: Using End-to-End Learning Based High-Level Features and Classification Approaches

Arxiv

0+阅读 · 2023年7月13日

On Collaboration in Distributed Parameter Estimation with Resource Constraints

Arxiv

0+阅读 · 2023年7月12日

Robust scalable initialization for Bayesian variational inference with multi-modal Laplace approximations

Arxiv

0+阅读 · 2023年7月12日

Hate Speech Detection via Dual Contrastive Learning

Arxiv

0+阅读 · 2023年7月10日

VIP会员

文章信息

相关主题

估计/估计量

自动语音识别

相关VIP内容

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【书籍】从零开始构建文本生成图像生成器：基于 Transformers 与扩散模型

人工智能与未来指挥

【伯克利博士论文】将大语言模型绑定至虚拟人格：实现人类行为模拟

稀疏自编码器综述：解释大语言模型的内部机制

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Arxiv

0+阅读 · 2023年7月13日

Automated Deception Detection from Videos: Using End-to-End Learning Based High-Level Features and Classification Approaches

Arxiv

0+阅读 · 2023年7月13日

On Collaboration in Distributed Parameter Estimation with Resource Constraints

Arxiv

0+阅读 · 2023年7月12日

Robust scalable initialization for Bayesian variational inference with multi-modal Laplace approximations

Arxiv

0+阅读 · 2023年7月12日

Hate Speech Detection via Dual Contrastive Learning

Arxiv

0+阅读 · 2023年7月10日

相关基金

CHCHD10在双酚A致精子线粒体氧化磷酸化障碍中的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Cdk5介导的Drp1磷酸化对线粒体的调控及其在阿尔茨海默病中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

鱼类清道夫受体识别和调控细菌炎症信号的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于list-mode数据的快速SART真3D PET断层重建算法的研究

国家自然科学基金

0+阅读 · 2011年12月31日

MIMO检测与合并中的智能信号处理研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员