句号MIM: 隐藏变量语言模式 (SentenceMIM: A Latent Variable Language Model) - 专知论文

会员服务 ·

0

INFORMS · 潜变量/隐变量 · 变分自编码 · 语言模型化 · 互信息 ·

2021 年 4 月 21 日

SentenceMIM: A Latent Variable Language Model

翻译：句号MIM: 隐藏变量语言模式

Micha Livne,Kevin Swersky,David J. Fleet

from arxiv, Preprint. Demo: https://github.com/seraphlabs-ca/SentenceMIM-demo

SentenceMIM is a probabilistic auto-encoder for language data, trained with Mutual Information Machine (MIM) learning to provide a fixed length representation of variable length language observations (i.e., similar to VAE). Previous attempts to learn VAEs for language data faced challenges due to posterior collapse. MIM learning encourages high mutual information between observations and latent variables, and is robust against posterior collapse. As such, it learns informative representations whose dimension can be an order of magnitude higher than existing language VAEs. Importantly, the SentenceMIM loss has no hyper-parameters, simplifying optimization. We compare sentenceMIM with VAE, and AE on multiple datasets. SentenceMIM yields excellent reconstruction, comparable to AEs, with a rich structured latent space, comparable to VAEs. The structured latent representation is demonstrated with interpolation between sentences of different lengths. We demonstrate the versatility of sentenceMIM by utilizing a trained model for question-answering and transfer learning, without fine-tuning, outperforming VAE and AE with similar architectures.

翻译：句号MIM是语言数据的一个概率自动编码器,受过相互信息机器(MIM)培训,以提供不同语言的固定长度表示不同语言的观察(即类似于VAE)。以前为语言数据而学习VAE的尝试由于后天崩溃而面临挑战。MIM学习鼓励观测和潜在变量之间的高度相互信息,并有力地防止后天崩溃。因此,它学会了信息化的表述,其范围可能比现有语言VAEs高出一个数量级。重要的是,句号MIM损失没有超度参数,简化了优化。我们在多个数据集上将句号MIM与VAE和AE作比较。句号与AE作对比,可以进行与AE相比的极佳的重建,结构化潜力空间与VAE相似。结构化的潜在空间与VAE相似,通过不同长度的句号之间的相互调而表现出了结构化的潜在代表性。我们通过使用经过培训的问答和转移学习模式来证明句号的多重性。我们用一个经过精细调的模型来证明MIM具有类似的结构。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

ICML 2021论文收录

ICML 2021论文收录

专知会员服务

123+阅读 · 2021年5月8日

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

专知会员服务

90+阅读 · 2020年12月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【纽约大学】产生新的概念与混合神经符号模型，Generating new concepts with hybrid neuro-symbolic models

【纽约大学】产生新的概念与混合神经符号模型，Generating new concepts with hybrid neuro-symbolic models

专知会员服务

17+阅读 · 2020年3月23日

【AAAI Tutorials 2019】深度贝叶斯与序列学习（ Deep Bayesian and Sequential Learning）

【AAAI Tutorials 2019】深度贝叶斯与序列学习（ Deep Bayesian and Sequential Learning）

专知会员服务

72+阅读 · 2019年11月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

深度神经网络生成模型：从 GAN VAE 到 CVAE-GAN

深度神经网络生成模型：从 GAN VAE 到 CVAE-GAN

AI100

11+阅读 · 2017年7月20日

Model Selection for Bayesian Autoencoders

Arxiv

0+阅读 · 2021年6月11日

Score-based Generative Modeling in Latent Space

Arxiv

0+阅读 · 2021年6月10日

Local Disentanglement in Variational Auto-Encoders Using Jacobian $L_1$ Regularization

Arxiv

0+阅读 · 2021年6月5日

Measuring and Improving Consistency in Pretrained Language Models

Arxiv

0+阅读 · 2021年5月29日

Learning Sparse Sentence Encoding without Supervision: An Exploration of Sparsity in Variational Autoencoders

Arxiv

0+阅读 · 2021年4月16日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

Pre-trained Language Model Representations for Language Generation

Arxiv

5+阅读 · 2019年4月1日

Topic Compositional Neural Language Model

Arxiv

5+阅读 · 2018年2月26日

Discrete Autoencoders for Sequence Models

Arxiv

6+阅读 · 2018年1月29日

VIP会员

文章信息

相关主题

潜变量/隐变量

变分自编码

语言模型化

相关VIP内容

ICML 2021论文收录

ICML 2021论文收录

专知会员服务

123+阅读 · 2021年5月8日

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

不可错过！MILA最新《自监督表示学习》课程，附PPT与视频下载

专知会员服务

90+阅读 · 2020年12月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

【斯坦福】机器学习优化简明导论， Introduction to Optimization for Machine Learning

专知会员服务

93+阅读 · 2020年5月6日

【纽约大学】产生新的概念与混合神经符号模型，Generating new concepts with hybrid neuro-symbolic models

【纽约大学】产生新的概念与混合神经符号模型，Generating new concepts with hybrid neuro-symbolic models

专知会员服务

17+阅读 · 2020年3月23日

【AAAI Tutorials 2019】深度贝叶斯与序列学习（ Deep Bayesian and Sequential Learning）

【AAAI Tutorials 2019】深度贝叶斯与序列学习（ Deep Bayesian and Sequential Learning）

专知会员服务

72+阅读 · 2019年11月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

强化学习族谱

强化学习族谱

CreateAMind

26+阅读 · 2017年8月2日

深度神经网络生成模型：从 GAN VAE 到 CVAE-GAN

深度神经网络生成模型：从 GAN VAE 到 CVAE-GAN

AI100

11+阅读 · 2017年7月20日

相关论文

Model Selection for Bayesian Autoencoders

Arxiv

0+阅读 · 2021年6月11日

Score-based Generative Modeling in Latent Space

Arxiv

0+阅读 · 2021年6月10日

Local Disentanglement in Variational Auto-Encoders Using Jacobian $L_1$ Regularization

Arxiv

0+阅读 · 2021年6月5日

Measuring and Improving Consistency in Pretrained Language Models

Arxiv

0+阅读 · 2021年5月29日

Learning Sparse Sentence Encoding without Supervision: An Exploration of Sparsity in Variational Autoencoders

Arxiv

0+阅读 · 2021年4月16日

UniLMv2: Pseudo-Masked Language Models for Unified Language Model Pre-Training

Arxiv

15+阅读 · 2020年2月28日

Latent Relation Language Models

Arxiv

21+阅读 · 2019年8月21日

Pre-trained Language Model Representations for Language Generation

Arxiv

5+阅读 · 2019年4月1日

Topic Compositional Neural Language Model

Arxiv

5+阅读 · 2018年2月26日

Discrete Autoencoders for Sequence Models

Arxiv

6+阅读 · 2018年1月29日

微信扫码咨询专知VIP会员