NLI中的普及化:超越简单维值的方法(不是) (Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics) - 专知论文

会员服务 ·

0

泛化理论 · SimPLe · MoDELS · 子采样 · Siamese ·

2021 年 10 月 4 日

Generalization in NLI: Ways (Not) To Go Beyond Simple Heuristics

翻译：NLI中的普及化:超越简单维值的方法(不是)

Prajjwal Bhargava,Aleksandr Drozd,Anna Rogers

from arxiv, Workshop on Insights from Negative Results (EMNLP 2021)

Much of recent progress in NLU was shown to be due to models' learning dataset-specific heuristics. We conduct a case study of generalization in NLI (from MNLI to the adversarially constructed HANS dataset) in a range of BERT-based architectures (adapters, Siamese Transformers, HEX debiasing), as well as with subsampling the data and increasing the model size. We report 2 successful and 3 unsuccessful strategies, all providing insights into how Transformer-based models learn to generalize.

翻译：国家实验室股最近取得的许多进展都归功于模型学习数据集的特有理论。我们开展了一项案例研究,对基于BERT的一系列建筑(适应器、暹罗变异器、HEX除偏差器)的NLI(从MNLI到对抗性构建的HANS数据集)的普及性进行了案例研究,并对数据进行了子取样,并增加了模型大小。我们报告了2个成功和3个失败的战略,都对基于变异器的模型如何学会概括化提供了深刻的洞察力。

0

相关内容

泛化理论

【干货书】数据科学手册，456页pdf

专知会员服务

150+阅读 · 2021年9月16日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

【2020新书】Python专业实践，250页pdf，Practices of the Python Pro

【2020新书】Python专业实践，250页pdf，Practices of the Python Pro

专知会员服务

60+阅读 · 2020年11月15日

现代机器学习技术导论，596页pdf

专知会员服务

168+阅读 · 2020年7月27日

【ACL2020】计算语言学在深度学习领域不可阻挡的崛起，The Unstoppable Rise of Computational Linguistics in Deep Learning

【ACL2020】计算语言学在深度学习领域不可阻挡的崛起，The Unstoppable Rise of Computational Linguistics in Deep Learning

专知会员服务

8+阅读 · 2020年5月15日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

7+阅读 · 2018年12月12日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

Neural Fields in Visual Computing and Beyond

Arxiv

1+阅读 · 2021年11月29日

Paradigm Shift in Natural Language Processing

Arxiv

28+阅读 · 2021年9月26日

CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding

Arxiv

3+阅读 · 2021年7月1日

Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework

Arxiv

9+阅读 · 2021年3月4日

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Arxiv

5+阅读 · 2021年2月21日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

This Looks Like That: Deep Learning for Interpretable Image Recognition

Arxiv

5+阅读 · 2018年12月19日

On the Implicit Assumptions of GANs

Arxiv

6+阅读 · 2018年11月29日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs

Arxiv

8+阅读 · 2018年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】数据科学手册，456页pdf

专知会员服务

150+阅读 · 2021年9月16日

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

MIT经典《线性代数》，584页pdf，Introduction to Linear Algebra, Fifth Edition, Gilbert Strang, 2016.

专知会员服务

428+阅读 · 2021年1月11日

【2020新书】Python专业实践，250页pdf，Practices of the Python Pro

【2020新书】Python专业实践，250页pdf，Practices of the Python Pro

专知会员服务

60+阅读 · 2020年11月15日

现代机器学习技术导论，596页pdf

专知会员服务

168+阅读 · 2020年7月27日

【ACL2020】计算语言学在深度学习领域不可阻挡的崛起，The Unstoppable Rise of Computational Linguistics in Deep Learning

【ACL2020】计算语言学在深度学习领域不可阻挡的崛起，The Unstoppable Rise of Computational Linguistics in Deep Learning

专知会员服务

8+阅读 · 2020年5月15日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

《DeepGCNs: Making GCNs Go as Deep as CNNs》

《DeepGCNs: Making GCNs Go as Deep as CNNs》

专知会员服务

31+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

最新，DeepSeek-R1论文登上Nature封面，附83页补充材料

人工智能与未来战争

自动驾驶中的轨迹预测大型基础模型：全面综述

万字长文《对抗雷达系统的电子战综述》

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

已删除

将门创投

7+阅读 · 2018年12月12日

条件GAN重大改进！cGANs with Projection Discriminator

条件GAN重大改进！cGANs with Projection Discriminator

CreateAMind

8+阅读 · 2018年2月7日

相关论文

Neural Fields in Visual Computing and Beyond

Arxiv

1+阅读 · 2021年11月29日

Paradigm Shift in Natural Language Processing

Arxiv

28+阅读 · 2021年9月26日

CLINE: Contrastive Learning with Semantic Negative Examples for Natural Language Understanding

Arxiv

3+阅读 · 2021年7月1日

Extract the Knowledge of Graph Neural Networks and Go Beyond it: An Effective Knowledge Distillation Framework

Arxiv

9+阅读 · 2021年3月4日

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Arxiv

5+阅读 · 2021年2月21日

Language Models as Knowledge Bases?

Arxiv

6+阅读 · 2019年9月4日

This Looks Like That: Deep Learning for Interpretable Image Recognition

Arxiv

5+阅读 · 2018年12月19日

On the Implicit Assumptions of GANs

Arxiv

6+阅读 · 2018年11月29日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs

Arxiv

8+阅读 · 2018年1月16日

微信扫码咨询专知VIP会员