为什么自我支持的语音确认学习会给发言人带来什么好处呢? (Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?) - 专知论文

会员服务 ·

0

声纹识别 · 学成 · SSL · 语音识别 · Performer ·

2022 年 4 月 27 日

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?

翻译：为什么自我支持的语音确认学习会给发言人带来什么好处呢?

Sanyuan Chen,Yu Wu,Chengyi Wang,Shujie Liu,Zhuo Chen,Peidong Wang,Gang Liu,Jinyu Li,Jian Wu,Xiangzhan Yu,Furu Wei

from arxiv, Submitted to INTERSPEECH 2022

Recently, self-supervised learning (SSL) has demonstrated strong performance in speaker recognition, even if the pre-training objective is designed for speech recognition. In this paper, we study which factor leads to the success of self-supervised learning on speaker-related tasks, e.g. speaker verification (SV), through a series of carefully designed experiments. Our empirical results on the Voxceleb-1 dataset suggest that the benefit of SSL to SV task is from a combination of mask speech prediction loss, data scale, and model size, while the SSL quantizer has a minor impact. We further employ the integrated gradients attribution method and loss landscape visualization to understand the effectiveness of self-supervised learning for speaker recognition performance.

翻译：最近,自我监督的学习(SSL)在语音识别方面表现良好,即使培训前的目标是为语音识别设计的,我们也在本文中研究,通过一系列精心设计的实验,使自我监督的语音相关任务(如语音校验(SV))学习取得成功的因素是什么。我们在Voxceleb-1数据集上的经验结果表明,SSL对SV任务的好处在于将面具语音预测损失、数据规模和模型大小结合起来,而SSL量化工具的影响较小。我们还进一步采用综合梯度归属法和损失景观可视化来理解自我监督的语音识别学习效果的效果。

0

相关内容

声纹识别

说话人识别（Speaker Recognition），或者称为声纹识别（Voiceprint Recognition, VPR），是根据语音中所包含的说话人个性信息，利用计算机以及现在的信息识别技术，自动鉴别说话人身份的一种生物特征识别技术。说话人识别研究的目的就是从语音中提取具有说话人表征性的特征，建立有效的模型和系统，实现自动精准的说话人鉴别。

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于双树复小波的单幅遥感图像自适应去除云雾原理与方法的研究

国家自然科学基金

0+阅读 · 2014年12月31日

有限温度下位错的芯结构与Perierls应力的研究

国家自然科学基金

0+阅读 · 2013年12月31日

二向性反射分布函数的先验知识耦合式融合方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

候选lncRNAs调控脂肪干细胞成髓核分化的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

NiMnSnCo磁制冷材料快速凝固过程及微观结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

Tecto调节非洲爪蛙胚层决定与分化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

剧烈塑性变形条件下金属间化合物相变研究

国家自然科学基金

0+阅读 · 2012年12月31日

Nanostrands－膨胀石墨电磁屏蔽材料制备及性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于涡流发生器和合成射流技术的高超声速进气道流动控制方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

Self-Supervised Speech Representation Learning: A Review

Self-Supervised Speech Representation Learning: A Review

Arxiv

0+阅读 · 2022年6月15日

ReCo: Retrieve and Co-segment for Zero-shot Transfer

Arxiv

0+阅读 · 2022年6月14日

Self-supervised Vision Transformers for Joint SAR-optical Representation Learning

Self-supervised Vision Transformers for Joint SAR-optical Representation Learning

Arxiv

0+阅读 · 2022年6月14日

Content Popularity Prediction in Fog-RANs: A Clustered Federated Learning Based Approach

Arxiv

0+阅读 · 2022年6月13日

Contrastive and Non-Contrastive Self-Supervised Learning Recover Global and Local Spectral Embedding Methods

Contrastive and Non-Contrastive Self-Supervised Learning Recover Global and Local Spectral Embedding Methods

Arxiv

1+阅读 · 2022年6月10日

Exploring Feature Self-relation for Self-supervised Transformer

Exploring Feature Self-relation for Self-supervised Transformer

Arxiv

0+阅读 · 2022年6月10日

Multi-task Self-distillation for Graph-based Semi-Supervised Learning

Arxiv

0+阅读 · 2022年6月10日

Self-Supervised Learning for Recommender Systems: A Survey

Arxiv

12+阅读 · 2022年3月29日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Self-supervised Learning: Generative or Contrastive

Arxiv

25+阅读 · 2021年3月20日

VIP会员

文章信息

相关主题

相关VIP内容

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

近期必读的6篇 NeurIPS 2019 的零样本学习(Zero-Shot Learning)论文

专知会员服务

60+阅读 · 2019年12月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习三篇论文避免遗忘等

强化学习三篇论文避免遗忘等

CreateAMind

20+阅读 · 2019年5月24日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Self-Supervised Speech Representation Learning: A Review

Self-Supervised Speech Representation Learning: A Review

Arxiv

0+阅读 · 2022年6月15日

ReCo: Retrieve and Co-segment for Zero-shot Transfer

Arxiv

0+阅读 · 2022年6月14日

Self-supervised Vision Transformers for Joint SAR-optical Representation Learning

Self-supervised Vision Transformers for Joint SAR-optical Representation Learning

Arxiv

0+阅读 · 2022年6月14日

Content Popularity Prediction in Fog-RANs: A Clustered Federated Learning Based Approach

Arxiv

0+阅读 · 2022年6月13日

Contrastive and Non-Contrastive Self-Supervised Learning Recover Global and Local Spectral Embedding Methods

Contrastive and Non-Contrastive Self-Supervised Learning Recover Global and Local Spectral Embedding Methods

Arxiv

1+阅读 · 2022年6月10日

Exploring Feature Self-relation for Self-supervised Transformer

Exploring Feature Self-relation for Self-supervised Transformer

Arxiv

0+阅读 · 2022年6月10日

Multi-task Self-distillation for Graph-based Semi-Supervised Learning

Arxiv

0+阅读 · 2022年6月10日

Self-Supervised Learning for Recommender Systems: A Survey

Arxiv

12+阅读 · 2022年3月29日

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

Arxiv

18+阅读 · 2021年4月4日

Self-supervised Learning: Generative or Contrastive

Arxiv

25+阅读 · 2021年3月20日

相关基金

基于双树复小波的单幅遥感图像自适应去除云雾原理与方法的研究

国家自然科学基金

0+阅读 · 2014年12月31日

有限温度下位错的芯结构与Perierls应力的研究

国家自然科学基金

0+阅读 · 2013年12月31日

二向性反射分布函数的先验知识耦合式融合方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

候选lncRNAs调控脂肪干细胞成髓核分化的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

miR-146a靶向IRAK1与TRAF6调控非小细胞肺癌转移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

NiMnSnCo磁制冷材料快速凝固过程及微观结构研究

国家自然科学基金

0+阅读 · 2012年12月31日

Tecto调节非洲爪蛙胚层决定与分化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

剧烈塑性变形条件下金属间化合物相变研究

国家自然科学基金

0+阅读 · 2012年12月31日

Nanostrands－膨胀石墨电磁屏蔽材料制备及性能研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于涡流发生器和合成射流技术的高超声速进气道流动控制方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员