探索关于逻辑和逻辑理由的预学语言模型的普遍能力 (Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning) - 专知论文

会员服务 ·

0

语言模型化 · 泛化理论 · MoDELS · BART · 情景 ·

2021 年 10 月 19 日

Exploring Generalization Ability of Pretrained Language Models on Arithmetic and Logical Reasoning

翻译：探索关于逻辑和逻辑理由的预学语言模型的普遍能力

Cunxiang Wang,Boyuan Zheng,Yuchen Niu,Yue Zhang

from arxiv, Accepted by NLPCC2021

To quantitatively and intuitively explore the generalization ability of pre-trained language models (PLMs), we have designed several tasks of arithmetic and logical reasoning. We both analyse how well PLMs generalize when the test data is in the same distribution as the train data and when it is different, for the latter analysis, we have also designed a cross-distribution test set other than the in-distribution test set. We conduct experiments on one of the most advanced and publicly released generative PLM - BART. Our research finds that the PLMs can easily generalize when the distribution is the same, however, it is still difficult for them to generalize out of the distribution.

翻译：为了从数量上和直觉上探讨经过训练的语文模型(PLM)的通用能力,我们设计了数项计算和逻辑推理任务,我们共同分析在试验数据与火车数据分布相同时,PLMS一般化程度有多好,在试验数据与列车数据分布相同时,如果与列车数据分布不同,后者分析时,我们还设计了一个跨分布测试组,而不是分布测试组。我们在最先进和公开发行的基因M(PLM)-BART(BART)进行实验。我们的研究发现,如果分布相同,PLMS可以很容易地概括化。然而,它们仍然难以在分布中加以概括化。

0

相关内容

语言模型化

语言模型化

知识增强预训练语言模型:全面综述

知识增强预训练语言模型:全面综述

专知会员服务

93+阅读 · 2021年10月19日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【Cell】神经算法推理，Neural algorithmic reasoning

【Cell】神经算法推理，Neural algorithmic reasoning

专知会员服务

29+阅读 · 2021年7月16日

【EMNLP2020】自然语言生成，Neural Language Generation

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

自然语言处理中的注意力机制，Attention in Natural Language Processing

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

136+阅读 · 2020年5月30日

最新！Yann Lecun 纽约大学Spring2020深度学习课程，附PPT下载

最新！Yann Lecun 纽约大学Spring2020深度学习课程，附PPT下载

专知会员服务

47+阅读 · 2020年1月28日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Decomposing Natural Logic Inferences in Neural NLI

Decomposing Natural Logic Inferences in Neural NLI

Arxiv

0+阅读 · 2021年12月15日

Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning

Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning

Arxiv

0+阅读 · 2021年12月15日

Cross-Domain Generalization and Knowledge Transfer in Transformers Trained on Legal Data

Arxiv

0+阅读 · 2021年12月15日

Exploring Neural Models for Query-Focused Summarization

Arxiv

0+阅读 · 2021年12月14日

MPLR: a novel model for multi-target learning of logical rules for knowledge graph reasoning

Arxiv

1+阅读 · 2021年12月12日

Probing Linguistic Information For Logical Inference In Pre-trained Language Models

Arxiv

5+阅读 · 2021年12月3日

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

Arxiv

4+阅读 · 2021年2月22日

Natural Language Inference in Context -- Investigating Contextual Reasoning over Long Texts

Arxiv

6+阅读 · 2020年11月10日

Pretrained Transformers for Text Ranking: BERT and Beyond

Arxiv

28+阅读 · 2020年10月13日

Neural Arithmetic Logic Units

Neural Arithmetic Logic Units

Arxiv

5+阅读 · 2018年8月1日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

知识增强预训练语言模型:全面综述

知识增强预训练语言模型:全面综述

专知会员服务

93+阅读 · 2021年10月19日

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【Cell】神经算法推理，Neural algorithmic reasoning

【Cell】神经算法推理，Neural algorithmic reasoning

专知会员服务

29+阅读 · 2021年7月16日

【EMNLP2020】自然语言生成，Neural Language Generation

【EMNLP2020】自然语言生成，Neural Language Generation

专知会员服务

39+阅读 · 2020年11月20日

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

【CIKM2020】神经逻辑推理，Neural Logic Reasoning

专知会员服务

51+阅读 · 2020年8月25日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

自然语言处理中的注意力机制，Attention in Natural Language Processing

自然语言处理中的注意力机制，Attention in Natural Language Processing

专知会员服务

136+阅读 · 2020年5月30日

最新！Yann Lecun 纽约大学Spring2020深度学习课程，附PPT下载

最新！Yann Lecun 纽约大学Spring2020深度学习课程，附PPT下载

专知会员服务

47+阅读 · 2020年1月28日

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

【NLP模型的跨语言/跨领域迁移】《Transferring NLP models across languages and domains》

专知会员服务

43+阅读 · 2019年11月25日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

卫星导航技术发展综述

《美军"僚机"联合能力技术演示项目：有人-无人火炮作战》41页报告

美军条令《火力指挥》116页

可解释的人工智能在生物医学图像分析中的应用综述

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Decomposing Natural Logic Inferences in Neural NLI

Decomposing Natural Logic Inferences in Neural NLI

Arxiv

0+阅读 · 2021年12月15日

Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning

Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning

Arxiv

0+阅读 · 2021年12月15日

Cross-Domain Generalization and Knowledge Transfer in Transformers Trained on Legal Data

Arxiv

0+阅读 · 2021年12月15日

Exploring Neural Models for Query-Focused Summarization

Arxiv

0+阅读 · 2021年12月14日

MPLR: a novel model for multi-target learning of logical rules for knowledge graph reasoning

Arxiv

1+阅读 · 2021年12月12日

Probing Linguistic Information For Logical Inference In Pre-trained Language Models

Arxiv

5+阅读 · 2021年12月3日

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

Arxiv

4+阅读 · 2021年2月22日

Natural Language Inference in Context -- Investigating Contextual Reasoning over Long Texts

Arxiv

6+阅读 · 2020年11月10日

Pretrained Transformers for Text Ranking: BERT and Beyond

Arxiv

28+阅读 · 2020年10月13日

Neural Arithmetic Logic Units

Neural Arithmetic Logic Units

Arxiv

5+阅读 · 2018年8月1日

微信扫码咨询专知VIP会员