介绍ANAPA-TDNNN和Wav2Vec2.0 (Introducing ECAPA-TDNN and Wav2Vec2.0 Embeddings to Stuttering Detection)

The adoption of advanced deep learning (DL) architecture in stuttering detection (SD) tasks is challenging due to the limited size of the available datasets. To this end, this work introduces the application of speech embeddings extracted with pre-trained deep models trained on massive audio datasets for different tasks. In particular, we explore audio representations obtained using emphasized channel attention, propagation, and aggregation-time-delay neural network (ECAPA-TDNN) and Wav2Vec2.0 model trained on VoxCeleb and LibriSpeech datasets respectively. After extracting the embeddings, we benchmark with several traditional classifiers, such as a k-nearest neighbor, Gaussian naive Bayes, and neural network, for the stuttering detection tasks. In comparison to the standard SD system trained only on the limited SEP-28k dataset, we obtain a relative improvement of 16.74% in terms of overall accuracy over baseline. Finally, we have shown that combining two embeddings and concatenating multiple layers of Wav2Vec2.0 can further improve SD performance up to 1% and 2.64% respectively.

翻译：由于现有数据集规模有限,采用先进的深层次学习(DL)结构来探测(SD)任务具有挑战性。为此,这项工作引入了应用语言嵌入器,这些嵌入器采用经过预先训练的深层模型,这些模型是针对不同任务进行大规模音频数据集培训的。特别是,我们探索了利用强调的频道注意力、传播和聚合-时间间隔神经网络(ECAPA-TDNNN)和分别受过VoxCeleb和LibriSpeech培训的Wav2Vec2.0模型获得的音频表达器。在提取嵌入器后,我们与几个传统分类器,如K-近邻、高山天真湾和神经网络等,为静音检测任务设定基准。与仅就有限的SEP-28k数据集(ECPA-TDNN)培训的标准SD系统相比,我们比基线的总体精确度提高了16.74%。最后,我们显示,将两个嵌入和配置Wav2Ve2.0多层数据组合起来,可以分别将SD的性性性能分别提高到1%和2.64%。

相关内容

Neural Networks

关注 1643

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

77+阅读 · 2022年3月15日

最新《Transformers模型》教程，64页ppt

专知会员服务

319+阅读 · 2020年11月26日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日