分析语音语言建模的隐蔽自我监督演讲代表 (Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling) - 专知论文

会员服务 ·

0

Analysis · 离散化 · 语言模型化 · 相关系数 · 音素 ·

2023 年 1 月 2 日

Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling

翻译：分析语音语言建模的隐蔽自我监督演讲代表

Amitay Sicherman,Yossi Adi

This work profoundly analyzes discrete self-supervised speech representations through the eyes of Generative Spoken Language Modeling (GSLM). Following the findings of such an analysis, we propose practical improvements to the discrete unit for the GSLM. First, we start comprehending these units by analyzing them in three axes: interpretation, visualization, and resynthesis. Our analysis finds a high correlation between the speech units to phonemes and phoneme families, while their correlation with speaker or gender is weaker. Additionally, we found redundancies in the extracted units and claim that one reason may be the units' context. Following this analysis, we propose a new, unsupervised metric to measure unit redundancies. Finally, we use this metric to develop new methods that improve the robustness of units clustering and show significant improvement considering zero-resource speech metrics such as ABX. Code and analysis tools are available under the following link.

翻译：这项工作深入分析了通过Generation Spoken语言建模(GSLM)的眼神,自我监督的单独语言表达方式。根据这一分析的结果,我们建议对GSL的离散单元进行实际改进。首先,我们开始从三个轴分析这些单元:解释、可视化和再合成。我们的分析发现,语音单位与电话和电话家庭之间有着高度的相互关系,而它们与语音或性别的关系则较弱。此外,我们发现抽取单元的冗余,并声称一个原因可能是单位的背景。在进行这一分析之后,我们提出了一个新的、不受监督的衡量单位冗余的衡量标准。最后,我们利用这一指标来制定新方法,提高单位集群的稳健性,并表明在考虑诸如ABX等零资源语言衡量标准时,有了显著的改进。代码和分析工具可以在以下链接下找到。

0

相关内容

Analysis

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

专知会员服务

90+阅读 · 2020年12月21日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

手性桥联双脒基稀土配合物的合成及催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Maresin1 调控巨噬细胞表型转化在肺损伤中的作用及其机制

国家自然科学基金

0+阅读 · 2015年12月31日

内皮祖细胞源性微粒microRNA抑制内皮间质转化改善心脏纤维化的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

负性共刺激分子B7-H3与c-Met结合调控EMT促进结直肠癌的转移及机制

国家自然科学基金

0+阅读 · 2015年12月31日

蓖麻B3转录因子对脂肪酸羟基化酶的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

双靶点抑制c-met和VEGFR2治疗高侵袭性肝细胞癌及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞和个体水平上Vaspin与胰岛素抵抗相互关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

TGF-β1通路调控MET在滑膜肉瘤双相分化和侵袭转移中作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

HGF、IGF通过DNMTs调控肝细胞肝癌恶性表型的DNA甲基化机制

国家自然科学基金

0+阅读 · 2011年12月31日

胰腺星形细胞对胰腺癌化疗耐药的影响及其机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

Understanding The Robustness of Self-supervised Learning Through Topic Modeling

Arxiv

0+阅读 · 2023年2月28日

Ensemble knowledge distillation of self-supervised speech models

Arxiv

0+阅读 · 2023年2月24日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling

Arxiv

13+阅读 · 2021年12月3日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

Graph Learning: A Survey

Arxiv

58+阅读 · 2021年5月3日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

专知会员服务

90+阅读 · 2020年12月21日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】在低维和高维空间中分析、建模和转换潜在表征

从无人机到数据：揭示边缘计算作为新作战域

可解释人工智能的基础

大规模视觉模型中的基于提示的适应：综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知

133+阅读 · 2020年3月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Understanding The Robustness of Self-supervised Learning Through Topic Modeling

Arxiv

0+阅读 · 2023年2月28日

Ensemble knowledge distillation of self-supervised speech models

Arxiv

0+阅读 · 2023年2月24日

Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation

Arxiv

11+阅读 · 2021年12月16日

Mind Your Clever Neighbours: Unsupervised Person Re-identification via Adaptive Clustering Relationship Modeling

Arxiv

13+阅读 · 2021年12月3日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Cross-Modal Discrete Representation Learning

Arxiv

18+阅读 · 2021年6月10日

Graph Learning: A Survey

Arxiv

58+阅读 · 2021年5月3日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

Self-correcting Q-Learning

Arxiv

11+阅读 · 2020年12月2日

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Look-into-Object: Self-supervised Structure Modeling for Object Recognition

Arxiv

15+阅读 · 2020年3月31日

相关基金

手性桥联双脒基稀土配合物的合成及催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

Maresin1 调控巨噬细胞表型转化在肺损伤中的作用及其机制

国家自然科学基金

0+阅读 · 2015年12月31日

内皮祖细胞源性微粒microRNA抑制内皮间质转化改善心脏纤维化的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

负性共刺激分子B7-H3与c-Met结合调控EMT促进结直肠癌的转移及机制

国家自然科学基金

0+阅读 · 2015年12月31日

蓖麻B3转录因子对脂肪酸羟基化酶的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

双靶点抑制c-met和VEGFR2治疗高侵袭性肝细胞癌及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

细胞和个体水平上Vaspin与胰岛素抵抗相互关系研究

国家自然科学基金

0+阅读 · 2012年12月31日

TGF-β1通路调控MET在滑膜肉瘤双相分化和侵袭转移中作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

HGF、IGF通过DNMTs调控肝细胞肝癌恶性表型的DNA甲基化机制

国家自然科学基金

0+阅读 · 2011年12月31日

胰腺星形细胞对胰腺癌化疗耐药的影响及其机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员