【EMNLP 2019 最佳论文】信息瓶颈专门化单词嵌入（用于解析）（Specializing Word Embeddings（for Parsing）by Information Bottleneck） - 专知VIP

会员服务 ·

2

深度学习 · EMNLP · Xiang Lisa Li · 词向量表示 · 约翰斯·霍普金斯大学 ·

2019 年 11 月 20 日

【EMNLP 2019 最佳论文】信息瓶颈专门化单词嵌入（用于解析）（Specializing Word Embeddings（for Parsing）by Information Bottleneck）

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

论文题目： Specializing Word Embeddings（for Parsing）by Information Bottleneck

论文摘要： 预训练词向量，如 ELMo 和 BERT 包括了丰富的句法和语义信息，使这些模型能够在各种任务上达到 SOTA 表现。在本文中，研究者则提出了一个非常快速的变分信息瓶颈方法，能够用非线性的方式压缩这些嵌入，仅保留能够帮助句法解析器的信息。研究者将每个词嵌入压缩成一个离散标签，或者一个连续向量。在离散的模式下，压缩的离散标签可以组成一种替代标签集。通过实验可以说明，这种标签集能够捕捉大部分传统 POS 标签标注的信息，而且这种标签序列在语法解析的过程中更为精确（在标签质量相似的情况下）。而在连续模式中，研究者通过实验说明，适当地压缩词嵌入可以在 8 种语言中产生更精确的语法解析器。这比简单的降维方法要好。

作者简介：

Xiang Lisa Li，约翰斯·霍普金斯大学的大四学生，其导师是著名NLP学者Jason Eisner，研究结构化预测和语法。

Jason Eisner，约翰斯·霍普金斯大学计算机科学系教授，ACL研究员。

成为VIP会员查看完整内容

Specializing Word Embeddings（for Parsing）by Information Bottleneck.pdf

24

相关内容

深度学习

机器学习的一个分支，它基于试图使用包含复杂结构或由多重非线性变换构成的多个处理层对数据进行高层抽象的一系列算法。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

专知会员服务

20+阅读 · 2020年4月14日

NLP基础任务:文本分类近年发展汇总,68页超详细解析

NLP基础任务:文本分类近年发展汇总,68页超详细解析

专知会员服务

58+阅读 · 2020年1月3日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

66+阅读 · 2019年12月20日

【斯坦福课程：从语言到信息】《CS 124: From Languages to Information (Winter 2020)》by Dan Jurafsky

【斯坦福课程：从语言到信息】《CS 124: From Languages to Information (Winter 2020)》by Dan Jurafsky

专知会员服务

17+阅读 · 2019年12月2日

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

专知会员服务

19+阅读 · 2019年11月18日

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

专知会员服务

48+阅读 · 2019年11月17日

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

专知会员服务

11+阅读 · 2019年11月16日

【多语言模型跨任务嵌入投影】Multilingual model using cross-task embedding projection，CoNLL 2019，附论文和PPT免费下载

【多语言模型跨任务嵌入投影】Multilingual model using cross-task embedding projection，CoNLL 2019，附论文和PPT免费下载

专知会员服务

10+阅读 · 2019年11月4日

六篇 EMNLP 2019【图神经网络(GNN)+NLP】相关论文

六篇 EMNLP 2019【图神经网络(GNN)+NLP】相关论文

专知会员服务

72+阅读 · 2019年11月3日

阿尔伯塔大学博士毕业论文：基于图结构的自然语言处理

阿尔伯塔大学博士毕业论文：基于图结构的自然语言处理

机器之心

15+阅读 · 2020年3月25日

论文浅尝 | 基于微量资源的神经网络跨语言命名实体识别

论文浅尝 | 基于微量资源的神经网络跨语言命名实体识别

开放知识图谱

6+阅读 · 2019年8月19日

哈工大SCIR两篇论文被IJCAI 2019录用

哈工大SCIR两篇论文被IJCAI 2019录用

哈工大SCIR

7+阅读 · 2019年5月11日

R语言自然语言处理：文本向量化——词嵌入（Word Embedding）

R语言自然语言处理：文本向量化——词嵌入（Word Embedding）

R语言中文社区

10+阅读 · 2019年4月6日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

情感分析词嵌入预处理细粒度实验综述（附20页全文下载）

情感分析词嵌入预处理细粒度实验综述（附20页全文下载）

专知

18+阅读 · 2019年2月5日

CoNLL 2018 | 最佳论文揭晓：词嵌入获得的信息远比我们想象中的要多得多

CoNLL 2018 | 最佳论文揭晓：词嵌入获得的信息远比我们想象中的要多得多

黑龙江大学自然语言处理实验室

3+阅读 · 2018年11月2日

赛尔原创 | COLING 2018 中文零指代消解：基于注意力机制的模型

赛尔原创 | COLING 2018 中文零指代消解：基于注意力机制的模型

哈工大SCIR

8+阅读 · 2018年7月23日

NAACL 2018 | 最佳论文：艾伦人工智能研究所提出新型深度语境化词表征

NAACL 2018 | 最佳论文：艾伦人工智能研究所提出新型深度语境化词表征

机器之心

5+阅读 · 2018年6月7日

深度 | 当前最好的词句嵌入技术概览：从无监督学习转向监督、多任务学习

深度 | 当前最好的词句嵌入技术概览：从无监督学习转向监督、多任务学习

机器之心

3+阅读 · 2018年6月6日

Learning Conceptual-Contexual Embeddings for Medical Text

Arxiv

27+阅读 · 2019年8月16日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation

Arxiv

3+阅读 · 2019年6月24日

Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Arxiv

3+阅读 · 2019年1月31日

Multi-task learning to improve natural language understanding

Arxiv

4+阅读 · 2018年12月17日

Graph Convolutional Networks for Text Classification

Arxiv

11+阅读 · 2018年10月17日

Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

Arxiv

5+阅读 · 2018年8月6日

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Arxiv

5+阅读 · 2018年7月21日

Hybrid semi-Markov CRF for Neural Sequence Labeling

Arxiv

5+阅读 · 2018年5月10日

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Arxiv

5+阅读 · 2018年5月10日

VIP会员

相关主题

词向量表示

约翰斯·霍普金斯大学

相关VIP内容

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

【ACL2020】命名实体识别即依存解析，Named Entity Recognition as Dependency Parsing

专知会员服务

61+阅读 · 2020年5月15日

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

【ACL2020-伯克利】预训练Transformer提高分布外鲁棒性

专知会员服务

20+阅读 · 2020年4月14日

NLP基础任务:文本分类近年发展汇总,68页超详细解析

NLP基础任务:文本分类近年发展汇总,68页超详细解析

专知会员服务

58+阅读 · 2020年1月3日

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

论深度学习的信息瓶颈理论（On the information bottleneck theory of deep learning）

专知会员服务

66+阅读 · 2019年12月20日

【斯坦福课程：从语言到信息】《CS 124: From Languages to Information (Winter 2020)》by Dan Jurafsky

【斯坦福课程：从语言到信息】《CS 124: From Languages to Information (Winter 2020)》by Dan Jurafsky

专知会员服务

17+阅读 · 2019年12月2日

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

【AAAI 2019 Tutorial】超越单词的神经向量表示:句子和文档嵌入（Neural Vector Representations beyond Words: Sentence and Document Embeddings），Gerard de Melo

专知会员服务

19+阅读 · 2019年11月18日

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

【ACL 2019 Tutorials】深度贝叶斯自然语言处理（Deep Bayesian Natural Language Processing），Jen-Tzung Chien

专知会员服务

48+阅读 · 2019年11月17日

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

【ACL 2019 Tutorials】话语分析及其应用（Discourse Analysis and Its Applications），Shafiq Joty，Giuseppe Carenini，Raymond Ng，Gabriel Murray

专知会员服务

11+阅读 · 2019年11月16日

【多语言模型跨任务嵌入投影】Multilingual model using cross-task embedding projection，CoNLL 2019，附论文和PPT免费下载

【多语言模型跨任务嵌入投影】Multilingual model using cross-task embedding projection，CoNLL 2019，附论文和PPT免费下载

专知会员服务

10+阅读 · 2019年11月4日

六篇 EMNLP 2019【图神经网络(GNN)+NLP】相关论文

六篇 EMNLP 2019【图神经网络(GNN)+NLP】相关论文

专知会员服务

72+阅读 · 2019年11月3日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

阿尔伯塔大学博士毕业论文：基于图结构的自然语言处理

阿尔伯塔大学博士毕业论文：基于图结构的自然语言处理

机器之心

15+阅读 · 2020年3月25日

论文浅尝 | 基于微量资源的神经网络跨语言命名实体识别

论文浅尝 | 基于微量资源的神经网络跨语言命名实体识别

开放知识图谱

6+阅读 · 2019年8月19日

哈工大SCIR两篇论文被IJCAI 2019录用

哈工大SCIR两篇论文被IJCAI 2019录用

哈工大SCIR

7+阅读 · 2019年5月11日

R语言自然语言处理：文本向量化——词嵌入（Word Embedding）

R语言自然语言处理：文本向量化——词嵌入（Word Embedding）

R语言中文社区

10+阅读 · 2019年4月6日

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

BAM！利用知识蒸馏和多任务学习构建的通用语言模型

机器之心

15+阅读 · 2019年3月18日

情感分析词嵌入预处理细粒度实验综述（附20页全文下载）

情感分析词嵌入预处理细粒度实验综述（附20页全文下载）

专知

18+阅读 · 2019年2月5日

CoNLL 2018 | 最佳论文揭晓：词嵌入获得的信息远比我们想象中的要多得多

CoNLL 2018 | 最佳论文揭晓：词嵌入获得的信息远比我们想象中的要多得多

黑龙江大学自然语言处理实验室

3+阅读 · 2018年11月2日

赛尔原创 | COLING 2018 中文零指代消解：基于注意力机制的模型

赛尔原创 | COLING 2018 中文零指代消解：基于注意力机制的模型

哈工大SCIR

8+阅读 · 2018年7月23日

NAACL 2018 | 最佳论文：艾伦人工智能研究所提出新型深度语境化词表征

NAACL 2018 | 最佳论文：艾伦人工智能研究所提出新型深度语境化词表征

机器之心

5+阅读 · 2018年6月7日

深度 | 当前最好的词句嵌入技术概览：从无监督学习转向监督、多任务学习

深度 | 当前最好的词句嵌入技术概览：从无监督学习转向监督、多任务学习

机器之心

3+阅读 · 2018年6月6日

相关论文

Learning Conceptual-Contexual Embeddings for Medical Text

Arxiv

27+阅读 · 2019年8月16日

Language Modeling with Deep Transformers

Arxiv

6+阅读 · 2019年7月11日

Language Modelling Makes Sense: Propagating Representations through WordNet for Full-Coverage Word Sense Disambiguation

Arxiv

3+阅读 · 2019年6月24日

Multi-Task Deep Neural Networks for Natural Language Understanding

Multi-Task Deep Neural Networks for Natural Language Understanding

Arxiv

3+阅读 · 2019年1月31日

Multi-task learning to improve natural language understanding

Arxiv

4+阅读 · 2018年12月17日

Graph Convolutional Networks for Text Classification

Arxiv

11+阅读 · 2018年10月17日

Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

Seq2Seq2Sentiment: Multimodal Sequence to Sequence Models for Sentiment Analysis

Arxiv

5+阅读 · 2018年8月6日

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Invariant Information Distillation for Unsupervised Image Segmentation and Clustering

Arxiv

5+阅读 · 2018年7月21日

Hybrid semi-Markov CRF for Neural Sequence Labeling

Arxiv

5+阅读 · 2018年5月10日

Attention Focusing for Neural Machine Translation by Bridging Source and Target Embeddings

Arxiv

5+阅读 · 2018年5月10日

微信扫码咨询专知VIP会员