不同部分演讲时间通用语音识别编码器 (Universal speaker recognition encoders for different speech segments duration) - 专知论文

会员服务 ·

0

大学 · 声纹识别 · Networking · Neural Networks · 优化器 ·

2022 年 10 月 28 日

Universal speaker recognition encoders for different speech segments duration

翻译：不同部分演讲时间通用语音识别编码器

Sergey Novoselov,Vladimir Volokhov,Galina Lavrentyeva

from arxiv, Submitted to ICASSP'23

Creating universal speaker encoders which are robust for different acoustic and speech duration conditions is a big challenge today. According to our observations systems trained on short speech segments are optimal for short phrase speaker verification and systems trained on long segments are superior for long segments verification. A system trained simultaneously on pooled short and long speech segments does not give optimal verification results and usually degrades both for short and long segments. This paper addresses the problem of creating universal speaker encoders for different speech segments duration. We describe our simple recipe for training universal speaker encoder for any type of selected neural network architecture. According to our evaluation results of wav2vec-TDNN based systems obtained for NIST SRE and VoxCeleb1 benchmarks the proposed universal encoder provides speaker verification improvements in case of different enrollment and test speech segment duration. The key feature of the proposed encoder is that it has the same inference time as the selected neural network architecture.

翻译：创建通用的语音编码器对于不同的音响和语音持续时间条件来说都是一个巨大的挑战。根据我们在短语部分培训的观察系统,短语部分的最佳是短语部分的语音核查,长段部分培训的系统优于长段核查。同时培训短语和长语部分的系统不会产生最佳的核查结果,通常会降低短语段和长段的功能。本文件讨论了为不同语言部分创建通用语音编码器的问题。我们描述了为任何类型的选定的神经网络结构培训通用语音编码器的简单方法。根据我们为 NIST SRE 和 VoxCeleb1 提供的基于 wav2vec-TDN 系统的评估结果,拟议的通用编码器在不同的录制和测试语音部分持续时间方面都提供了语音部分的改进。拟议的编码器的关键特征是,它具有与选定的神经网络结构相同的推论时间。

0

相关内容

人类接受高层次教育、进行原创性研究的场所。现在的大学一般包括一个能授予硕士和博士学位的研究生院和数个专业学院，以及能授予学士学位的一个本科生院。大学还包括高等专科学校

【超赞的#C++#速查&信息图】“hacking c++ - Cheat Sheets & Infographics”

【超赞的#C++#速查&信息图】“hacking c++ - Cheat Sheets & Infographics”

专知会员服务

30+阅读 · 2022年3月8日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

多分析方法与多样本整合的代谢组学研究食管癌淋巴结转移生物标志物

国家自然科学基金

0+阅读 · 2015年12月31日

聚龙一号装置上钽在准等熵压加载下的强度特性研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-5591靶向AGER/ROS/JNK抑制MSCs氧化应激损伤在糖尿病创面修复中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

瘢痕疙瘩中DAB-1抑制E3连接酶SIAH1对TIEG1泛素化介导TGF-β/Smads信号通路的研究

国家自然科学基金

0+阅读 · 2014年12月31日

β2-AR/PKA通路在内皮祖细胞修复急性肾损伤中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Diversin介导非小细胞肺癌长春瑞滨耐药的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

PPARγ信号调制人前列腺癌细胞能量限制抑癌效应和自噬的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

AlSiC电子封装复合材料磨削用钎焊金刚石微刃砂轮的研究

国家自然科学基金

0+阅读 · 2012年12月31日

材料表面拓扑形貌的细胞响应

国家自然科学基金

0+阅读 · 2012年12月31日

Legumain在乳腺癌骨转移和破骨损伤过程中的作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Preventing RNN from Using Sequence Length as a Feature

Arxiv

0+阅读 · 2022年12月16日

The SPEC-RG Reference Architecture for the Edge Continuum

Arxiv

0+阅读 · 2022年12月15日

Multimodal Teacher Forcing for Reconstructing Nonlinear Dynamical Systems

Arxiv

0+阅读 · 2022年12月15日

IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages

Arxiv

0+阅读 · 2022年12月15日

Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation

Arxiv

0+阅读 · 2022年12月15日

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Arxiv

1+阅读 · 2022年12月14日

Universal Densities Exist for Every Finite Reference Measure

Arxiv

0+阅读 · 2022年12月14日

Mitigating Artifacts in Real-World Video Super-Resolution Models

Mitigating Artifacts in Real-World Video Super-Resolution Models

Arxiv

0+阅读 · 2022年12月14日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

How to represent part-whole hierarchies in a neural network

Arxiv

13+阅读 · 2021年2月25日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【超赞的#C++#速查&信息图】“hacking c++ - Cheat Sheets & Infographics”

【超赞的#C++#速查&信息图】“hacking c++ - Cheat Sheets & Infographics”

专知会员服务

30+阅读 · 2022年3月8日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

不确定环境下无人机三维路径规划研究 | 221页

远征作战军事后勤规划

大语言模型将如何改变军事指挥结构

美陆军能力集成与开发系统（ACIDS）流程指南 | 2025最新122页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Preventing RNN from Using Sequence Length as a Feature

Arxiv

0+阅读 · 2022年12月16日

The SPEC-RG Reference Architecture for the Edge Continuum

Arxiv

0+阅读 · 2022年12月15日

Multimodal Teacher Forcing for Reconstructing Nonlinear Dynamical Systems

Arxiv

0+阅读 · 2022年12月15日

IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages

Arxiv

0+阅读 · 2022年12月15日

Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation

Arxiv

0+阅读 · 2022年12月15日

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Arxiv

1+阅读 · 2022年12月14日

Universal Densities Exist for Every Finite Reference Measure

Arxiv

0+阅读 · 2022年12月14日

Mitigating Artifacts in Real-World Video Super-Resolution Models

Mitigating Artifacts in Real-World Video Super-Resolution Models

Arxiv

0+阅读 · 2022年12月14日

Understanding Diffusion Models: A Unified Perspective

Arxiv

14+阅读 · 2022年8月25日

How to represent part-whole hierarchies in a neural network

Arxiv

13+阅读 · 2021年2月25日

相关基金

多分析方法与多样本整合的代谢组学研究食管癌淋巴结转移生物标志物

国家自然科学基金

0+阅读 · 2015年12月31日

聚龙一号装置上钽在准等熵压加载下的强度特性研究

国家自然科学基金

0+阅读 · 2015年12月31日

miR-5591靶向AGER/ROS/JNK抑制MSCs氧化应激损伤在糖尿病创面修复中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

瘢痕疙瘩中DAB-1抑制E3连接酶SIAH1对TIEG1泛素化介导TGF-β/Smads信号通路的研究

国家自然科学基金

0+阅读 · 2014年12月31日

β2-AR/PKA通路在内皮祖细胞修复急性肾损伤中的作用及机制

国家自然科学基金

0+阅读 · 2013年12月31日

Diversin介导非小细胞肺癌长春瑞滨耐药的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

PPARγ信号调制人前列腺癌细胞能量限制抑癌效应和自噬的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

AlSiC电子封装复合材料磨削用钎焊金刚石微刃砂轮的研究

国家自然科学基金

0+阅读 · 2012年12月31日

材料表面拓扑形貌的细胞响应

国家自然科学基金

0+阅读 · 2012年12月31日

Legumain在乳腺癌骨转移和破骨损伤过程中的作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员