SG-VAD: 存储门语音活动探测 (SG-VAD: Stochastic Gates Based Speech Activity Detection) - 专知论文

会员服务 ·

0

可辨认的 · MoDELS · 无关特征 · HTTPS · 回合 ·

2022 年 10 月 28 日

SG-VAD: Stochastic Gates Based Speech Activity Detection

翻译：SG-VAD: 存储门语音活动探测

Jonathan Svirsky,Ofir Lindenbaum

We propose a novel voice activity detection (VAD) model in a low-resource environment. Our key idea is to model VAD as a denoising task, and construct a network that is designed to identify nuisance features for a speech classification task. We train the model to simultaneously identify irrelevant features while predicting the type of speech event. Our model contains only 7.8K parameters, outperforms the previously proposed methods on the AVA-Speech evaluation set, and provides comparative results on the HAVIC dataset. We present its architecture, experimental results, and ablation study on the model's components. We publish the code and the models here https://www.github.com/jsvir/vad.

翻译：我们提议在低资源环境中采用新型语音活动检测模式。我们的关键想法是将 VAD 模型作为拆卸任务,并建立一个旨在识别语言分类任务的骚扰特征的网络。我们训练模型,同时识别不相干特征,同时预测演讲事件的类型。我们的模型只包含7.8K参数,优于AVA-Speech 评估集中先前建议的方法,并提供关于HAVIC数据集的比较结果。我们展示了其结构、实验结果和模型组成部分的反差研究。我们在这里公布了代码和模型 https://www.github.com/jsvir/vad。

0

相关内容

可辨认的

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

去泛素化酶USP4调节SMAD4蛋白单泛素化并调控TGF-β/Activin信号的研究

国家自然科学基金

0+阅读 · 2014年12月31日

PCV2感染猪肺泡巨噬细胞自噬过程中miRNA差异表达谱及靶基因功能调控网络研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

乙型肝炎性肝癌性别差异的假基因活化与长链非编码RNA调控网络及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

Parameter estimation of the homodyned K distribution based on neural networks and trainable fractional-order moments

Arxiv

0+阅读 · 2022年12月16日

An Empirical Study of Deep Learning Models for Vulnerability Detection

Arxiv

0+阅读 · 2022年12月15日

Stochastic Zeroth order Descent with Structured Directions

Arxiv

0+阅读 · 2022年12月15日

Detection of False Data Injection Attacks Using the Autoencoder Approach

Arxiv

0+阅读 · 2022年12月14日

Bayesian Spectral Deconvolution Based on Binominal Distribution

Arxiv

1+阅读 · 2022年12月14日

VIP会员

文章信息

相关主题

相关VIP内容

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACML2025教程】迈向鲁棒且可信的大语言模型：问题与缓解策略

《利用人工智能改善军事警察行动：当下现状探索》最新95页报告

Google《AI智能体企业应用手册报告》，46页pdf

面向现代武装力量的高级AI驱动军事模拟与训练软件

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

相关论文

Parameter estimation of the homodyned K distribution based on neural networks and trainable fractional-order moments

Arxiv

0+阅读 · 2022年12月16日

An Empirical Study of Deep Learning Models for Vulnerability Detection

Arxiv

0+阅读 · 2022年12月15日

Stochastic Zeroth order Descent with Structured Directions

Arxiv

0+阅读 · 2022年12月15日

Detection of False Data Injection Attacks Using the Autoencoder Approach

Arxiv

0+阅读 · 2022年12月14日

Bayesian Spectral Deconvolution Based on Binominal Distribution

Arxiv

1+阅读 · 2022年12月14日

相关基金

去泛素化酶USP4调节SMAD4蛋白单泛素化并调控TGF-β/Activin信号的研究

国家自然科学基金

0+阅读 · 2014年12月31日

PCV2感染猪肺泡巨噬细胞自噬过程中miRNA差异表达谱及靶基因功能调控网络研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

乙型肝炎性肝癌性别差异的假基因活化与长链非编码RNA调控网络及功能研究

国家自然科学基金

0+阅读 · 2013年12月31日

DEC1、DEC2对人乳腺癌细胞衰老的调控作用及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员