通过迭接调整研究文字顺序 (Studying word order through iterative shuffling) - 专知论文

会员服务 ·

0

语言模型化 · Performer · MoDELS · 神经语言模型 · 推断 ·

2021 年 9 月 10 日

Studying word order through iterative shuffling

翻译：通过迭接调整研究文字顺序

Nikolay Malkin,Sameera Lanka,Pranav Goel,Nebojsa Jojic

from arxiv, EMNLP 2021

As neural language models approach human performance on NLP benchmark tasks, their advances are widely seen as evidence of an increasingly complex understanding of syntax. This view rests upon a hypothesis that has not yet been empirically tested: that word order encodes meaning essential to performing these tasks. We refute this hypothesis in many cases: in the GLUE suite and in various genres of English text, the words in a sentence or phrase can rarely be permuted to form a phrase carrying substantially different information. Our surprising result relies on inference by iterative shuffling (IBIS), a novel, efficient procedure that finds the ordering of a bag of words having the highest likelihood under a fixed language model. IBIS can use any black-box model without additional training and is superior to existing word ordering algorithms. Coalescing our findings, we discuss how shuffling inference procedures such as IBIS can benefit language modeling and constrained generation.

翻译：随着神经语言模型接近人类在NLP基准任务方面的表现,其进步被广泛视为越来越复杂的对语法理解的证据。这种观点基于一个尚未经过经验检验的假设:单词顺序编码对执行这些任务至关重要。我们在许多情况下反驳了这一假设:在GLUE套件和英文文本的各种版本中,一句话或短语中的单词很少能够被改写成含有完全不同的信息的短语。我们的惊人结果依赖于迭代拼接(ISIS)的推论,这是一个新颖而有效的程序,在固定语言模式下找到最有可能的一包单词的顺序。 IBIS可以使用任何黑箱模式,而无需额外的培训,并且优于现有的命令算法。我们的研究,我们讨论了像IBIS这样的拼写程序如何使语言建模和受限制的一代受益。

0

相关内容

语言模型化

语言模型化

2021年中国企业数字转型指数

专知会员服务

13+阅读 · 2021年9月30日

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【COLING2020】无监督依存解析的综述论文，12页pdf

【COLING2020】无监督依存解析的综述论文，12页pdf

专知会员服务

16+阅读 · 2020年10月27日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

专知会员服务

25+阅读 · 2019年12月26日

【NUS】神经问题生成的最近进展（Recent Advances in Neural Question Generation）

【NUS】神经问题生成的最近进展（Recent Advances in Neural Question Generation）

专知会员服务

16+阅读 · 2019年12月22日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

专知

29+阅读 · 2018年3月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Comparative Study of Long Document Classification

Arxiv

0+阅读 · 2021年11月1日

Achieving Model Robustness through Discrete Adversarial Training

Arxiv

0+阅读 · 2021年10月31日

Conical Classification For Computationally Efficient One-Class Topic Determination

Arxiv

0+阅读 · 2021年10月31日

Cause-effect inference through spectral independence in linear dynamical systems: theoretical foundations

Arxiv

0+阅读 · 2021年10月29日

Contrastive Active Inference

Arxiv

4+阅读 · 2021年10月19日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks

Arxiv

11+阅读 · 2019年9月8日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Cloze-driven Pretraining of Self-attention Networks

Arxiv

6+阅读 · 2019年3月19日

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Arxiv

3+阅读 · 2019年2月11日

VIP会员

文章信息

相关主题

语言模型化

神经语言模型

相关VIP内容

2021年中国企业数字转型指数

专知会员服务

13+阅读 · 2021年9月30日

【CMU】可扩展人工智能白皮书

专知会员服务

28+阅读 · 2021年7月3日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【COLING2020】无监督依存解析的综述论文，12页pdf

【COLING2020】无监督依存解析的综述论文，12页pdf

专知会员服务

16+阅读 · 2020年10月27日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

【ICLR2020 预训练的百科全书】弱监督的知识-预训练的语言模型（PRETRAINED ENCYCLOPEDIA: WEAKLY SUPERVISED KNOWLEDGE-PRETRAINED LANGUAGE MODEL）

专知会员服务

25+阅读 · 2019年12月26日

【NUS】神经问题生成的最近进展（Recent Advances in Neural Question Generation）

【NUS】神经问题生成的最近进展（Recent Advances in Neural Question Generation）

专知会员服务

16+阅读 · 2019年12月22日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

数据要素发展报告(2025年)：附下载

人工智能代理提升战时舰船战备水平

【NeurIPS2025教程】大语言模型规划

NeurIPS 2025 教程：深度学习训练不稳定性的理论洞见

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

ICLR2019最佳论文出炉

ICLR2019最佳论文出炉

专知

12+阅读 · 2019年5月6日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

【论文推荐】最新七篇自注意力机制(Self-attention)相关论文—结构化自注意力、相对位置、混合、句子表达、文本向量

专知

29+阅读 · 2018年3月12日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Comparative Study of Long Document Classification

Arxiv

0+阅读 · 2021年11月1日

Achieving Model Robustness through Discrete Adversarial Training

Arxiv

0+阅读 · 2021年10月31日

Conical Classification For Computationally Efficient One-Class Topic Determination

Arxiv

0+阅读 · 2021年10月31日

Cause-effect inference through spectral independence in linear dynamical systems: theoretical foundations

Arxiv

0+阅读 · 2021年10月29日

Contrastive Active Inference

Arxiv

4+阅读 · 2021年10月19日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional Networks

Arxiv

11+阅读 · 2019年9月8日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

Cloze-driven Pretraining of Self-attention Networks

Arxiv

6+阅读 · 2019年3月19日

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Arxiv

3+阅读 · 2019年2月11日

微信扫码咨询专知VIP会员