是否有必要使用独立的单位来建立口语语言建模? (Are discrete units necessary for Spoken Language Modeling?) - 专知论文

会员服务 ·

0

离散化 · 语言模型化 · Continuity · MoDELS · Learning ·

2022 年 8 月 22 日

Are discrete units necessary for Spoken Language Modeling?

翻译：是否有必要使用独立的单位来建立口语语言建模?

Tu Anh Nguyen,Benoit Sagot,Emmanuel Dupoux

Recent work in spoken language modeling shows the possibility of learning a language unsupervisedly from raw audio without any text labels. The approach relies first on transforming the audio into a sequence of discrete units (or pseudo-text) and then training a language model directly on such pseudo-text. Is such a discrete bottleneck necessary, potentially introducing irreversible errors in the encoding of the speech signal, or could we learn a language model without discrete units at all? In this work, we study the role of discrete versus continuous representations in spoken language modeling. We show that discretization is indeed essential for good results in spoken language modeling. We show that discretization removes linguistically irrelevant information from the continuous features, helping to improve language modeling performances. On the basis of this study, we train a language model on the discrete units of the HuBERT features, reaching new state-of-the-art results in the lexical, syntactic and semantic metrics of the Zero Resource Speech Challenge 2021 (Track 1 - Speech Only).

翻译：口语建模的近期工作显示了从没有文字标签的原始音频中不受监督地学习语言的可能性。这种方法首先依赖于将音频转换成一个离散单元的序列( 或伪文本), 然后直接用这种假文本来培训语言模型。这种离散的瓶颈是必要的, 可能在语音信号编码中引入不可逆转的错误, 或者我们可以学习一个语言模型, 完全没有离散单元? 在这项工作中, 我们研究了离散和连续演示在口语建模中的作用。我们显示, 离散对于口语建模的良好结果确实至关重要。我们显示, 离散将语言上无关的信息从连续的功能中去除, 有助于改进语言模型的性能。根据这项研究, 我们为HuBERT特性的离散单元培训一个语言模型, 在 Zero资源语音挑战 2021 ( Track 1 - 仅使用语音) 的词汇、和语系和语系测量中达到新的状态结果。

0

相关内容

离散化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

50+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

专知会员服务

216+阅读 · 2020年4月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

多层时空并行 Schwarz 算法的研究

国家自然科学基金

3+阅读 · 2017年12月31日

RNA解旋酶ddx39a在斑马鱼胚胎发育中的功能及调控细胞增殖的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA与蛋白相互作用在胚胎干细胞多能性中的研究

国家自然科学基金

0+阅读 · 2014年12月31日

癌症抑制因子ARID4A蛋白家族的结构与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

FoxM1维持肌肉干细胞多能性及调控肌肉干细胞再生的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

蛋白乙酰化修饰对天蓝色链霉菌发育分化的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

iPS细胞端粒重编程及其稳定性维持的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤细胞中凋亡抑制蛋白CFLAR乙酰化调控的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Saccharomyces cerevisiae NJWGYH30566产赤藓糖醇的辅酶工程及调控机理

国家自然科学基金

0+阅读 · 2011年12月31日

硫酸盐还原细菌共生代谢的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

Fault-tolerant Coding for Entanglement-Assisted Communication

Arxiv

0+阅读 · 2022年10月6日

Knowledge Unlearning for Mitigating Privacy Risks in Language Models

Arxiv

0+阅读 · 2022年10月4日

Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation

Arxiv

0+阅读 · 2022年10月4日

Hierarchical I3D for Sign Spotting

Arxiv

0+阅读 · 2022年10月3日

Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information

Arxiv

0+阅读 · 2022年9月30日

On The Robustness of Self-Supervised Representations for Spoken Language Modeling

Arxiv

0+阅读 · 2022年9月30日

What Makes Pre-trained Language Models Better Zero/Few-shot Learners?

Arxiv

0+阅读 · 2022年9月30日

Likelihood adjusted semidefinite programs for clustering heterogeneous data

Arxiv

0+阅读 · 2022年9月29日

HSD: A hierarchical singing annotation dataset

Arxiv

0+阅读 · 2022年9月26日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

50+阅读 · 2022年10月2日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

神经网络序列数据建模，229页ppt，Modeling Sequential Data with Neural Nets

专知会员服务

67+阅读 · 2020年7月25日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

108+阅读 · 2020年5月1日

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

深度学习自然语言处理概述，216页ppt，Jindřich Helcl

专知会员服务

216+阅读 · 2020年4月26日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】优化智能体工作流以提升信息获取效率

中文版 | 领导力：人工智能在决策中的定位

通信行业：智能低空通感网络白皮书

3D形状生成：综述

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Fault-tolerant Coding for Entanglement-Assisted Communication

Arxiv

0+阅读 · 2022年10月6日

Knowledge Unlearning for Mitigating Privacy Risks in Language Models

Arxiv

0+阅读 · 2022年10月4日

Nuisances via Negativa: Adjusting for Spurious Correlations via Data Augmentation

Arxiv

0+阅读 · 2022年10月4日

Hierarchical I3D for Sign Spotting

Arxiv

0+阅读 · 2022年10月3日

Predicting Cellular Responses with Variational Causal Inference and Refined Relational Information

Arxiv

0+阅读 · 2022年9月30日

On The Robustness of Self-Supervised Representations for Spoken Language Modeling

Arxiv

0+阅读 · 2022年9月30日

What Makes Pre-trained Language Models Better Zero/Few-shot Learners?

Arxiv

0+阅读 · 2022年9月30日

Likelihood adjusted semidefinite programs for clustering heterogeneous data

Arxiv

0+阅读 · 2022年9月29日

HSD: A hierarchical singing annotation dataset

Arxiv

0+阅读 · 2022年9月26日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

相关基金

多层时空并行 Schwarz 算法的研究

国家自然科学基金

3+阅读 · 2017年12月31日

RNA解旋酶ddx39a在斑马鱼胚胎发育中的功能及调控细胞增殖的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

长链非编码RNA与蛋白相互作用在胚胎干细胞多能性中的研究

国家自然科学基金

0+阅读 · 2014年12月31日

癌症抑制因子ARID4A蛋白家族的结构与功能研究

国家自然科学基金

0+阅读 · 2014年12月31日

FoxM1维持肌肉干细胞多能性及调控肌肉干细胞再生的分子机制

国家自然科学基金

0+阅读 · 2013年12月31日

蛋白乙酰化修饰对天蓝色链霉菌发育分化的调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

iPS细胞端粒重编程及其稳定性维持的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

肿瘤细胞中凋亡抑制蛋白CFLAR乙酰化调控的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

Saccharomyces cerevisiae NJWGYH30566产赤藓糖醇的辅酶工程及调控机理

国家自然科学基金

0+阅读 · 2011年12月31日

硫酸盐还原细菌共生代谢的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员