使用综合梯度和选区分析树解释BERT学会的语言可接受性 (Using Integrated Gradients and Constituency Parse Trees to explain Linguistic Acceptability learnt by BERT) - 专知论文

会员服务 ·

0

Lua · Parse · Integration · BERT · 解析树 ·

2022 年 3 月 8 日

Using Integrated Gradients and Constituency Parse Trees to explain Linguistic Acceptability learnt by BERT

翻译：使用综合梯度和选区分析树解释BERT学会的语言可接受性

Anmol Nayak,Hari Prasad Timmapathini

from arxiv, Accepted at International Conference on Natural Language Processing (ICON) 2021. 6 pages, 3 figures

Linguistic Acceptability is the task of determining whether a sentence is grammatical or ungrammatical. It has applications in several use cases like Question-Answering, Natural Language Generation, Neural Machine Translation, where grammatical correctness is crucial. In this paper we aim to understand the decision-making process of BERT (Devlin et al., 2019) in distinguishing between Linguistically Acceptable sentences (LA) and Linguistically Unacceptable sentences (LUA). We leverage Layer Integrated Gradients Attribution Scores (LIG) to explain the Linguistic Acceptability criteria that are learnt by BERT on the Corpus of Linguistic Acceptability (CoLA) (Warstadt et al., 2018) benchmark dataset. Our experiments on 5 categories of sentences lead to the following interesting findings: 1) LIG for LA are significantly smaller in comparison to LUA, 2) There are specific subtrees of the Constituency Parse Tree (CPT) for LA and LUA which contribute larger LIG, 3) Across the different categories of sentences we observed around 88% to 100% of the Correctly classified sentences had positive LIG, indicating a strong positive relationship to the prediction confidence of the model, and 4) Around 43% of the Misclassified sentences had negative LIG, which we believe can become correctly classified sentences if the LIG are parameterized in the loss function of the model.

翻译：语言可接受性是确定一项判决是语法还是非语法的任务。它适用于数个使用案例, 如问题解答、自然语言生成、神经机器翻译, 其中语法正确性至关重要。在本文中,我们旨在理解BERT(Devlin等人, 2019年)的决策过程,以区分语言可接受性判决(LA)和语言不可接受判决(LUA) 。我们利用层综合渐进分数(LIG)来解释BERT在语言可接受性(COLA)(Warstadt等人,2018年)基准数据集中学习的语言可接受性标准。我们在5类判决的实验得出以下令人感兴趣的结论:(1) LIGL比语言可接受性判决(LUA,2019年)要小得多。 LUA和LUA语言不可接受性判决(CPT)的具体次树模式有助于更大的LIG, 3) 在我们观察到的各类判决类别中大约88%至100%的可接受性标准(CLIG)基准数据集中,如果我们准确分类的LIG判决的正确相信LIG的正确判断性判断性判断性判决,那么LIG判决的精确性判断性判断性判断性判断性判决,那么, LIG的LIG判决的精确性判断性判断性判断性判断性判断性判断性判断性判断性判断性判断性判断性判断性判断性判决是正确性判决。

0

相关内容

Lua

Lua 是一门轻量而快速的脚本语言，功能在高级动态语言中十分完备，对 C API、嵌入式开发以及线程安全的 VM 的支持，使其非常流行。

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

123+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

57+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

2+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

39+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

镜对称及双有理代数几何背景下的高维 Calabi-Yau 簇

国家自然科学基金

0+阅读 · 2016年12月31日

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

矩阵流形的信息几何结构及其应用

国家自然科学基金

3+阅读 · 2013年12月31日

血管平滑肌细胞5-羟色胺受体1B差异表达影响手臂振动病发生的核因子-κB信号机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ICC胃动素受体在红霉素促胃肠动力中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

下一代移动通信上行链路迭代接收机形式化设计方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

多天线OFDM信道全信息压缩估计理论与方法

国家自然科学基金

0+阅读 · 2011年12月31日

广义Kloosterman和的均值估计

国家自然科学基金

0+阅读 · 2011年12月31日

癌症相关受体EGFR、Fas、ER和AR与钙调素相互作用的晶体结构研究

国家自然科学基金

1+阅读 · 2009年12月31日

半枝莲活性成分双向调节VEGF与DC机制新探索

国家自然科学基金

0+阅读 · 2008年12月31日

KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics

Arxiv

1+阅读 · 2022年4月20日

Analyzing Gender Representation in Multilingual Models

Arxiv

0+阅读 · 2022年4月20日

Practical considerations for specifying a super learner

Practical considerations for specifying a super learner

Arxiv

0+阅读 · 2022年4月19日

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Arxiv

0+阅读 · 2022年4月19日

I still have Time(s): Extending HeidelTime for German Texts

Arxiv

0+阅读 · 2022年4月19日

Impact of Tokenization on Language Models: An Analysis for Turkish

Arxiv

0+阅读 · 2022年4月19日

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Arxiv

0+阅读 · 2022年4月19日

On Safety Testing, Validation, and Characterization with Scenario-Sampling: A Case Study of Legged Robots

Arxiv

1+阅读 · 2022年4月16日

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words

Arxiv

0+阅读 · 2022年4月16日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

VIP会员

文章信息

相关主题

相关VIP内容

NLP必读经典文献100篇

专知会员服务

123+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

76+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

123+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

161+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

52+阅读 · 2020年1月30日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

57+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

53+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

168+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

77+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

52+阅读 · 2019年9月29日

热门VIP内容

相关资讯

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

2+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

39+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

23+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

41+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

16+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

KazakhTTS2: Extending the Open-Source Kazakh TTS Corpus With More Data, Speakers, and Topics

Arxiv

1+阅读 · 2022年4月20日

Analyzing Gender Representation in Multilingual Models

Arxiv

0+阅读 · 2022年4月20日

Practical considerations for specifying a super learner

Practical considerations for specifying a super learner

Arxiv

0+阅读 · 2022年4月19日

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Polling Latent Opinions: A Method for Computational Sociolinguistics Using Transformer Language Models

Arxiv

0+阅读 · 2022年4月19日

I still have Time(s): Extending HeidelTime for German Texts

Arxiv

0+阅读 · 2022年4月19日

Impact of Tokenization on Language Models: An Analysis for Turkish

Arxiv

0+阅读 · 2022年4月19日

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Arxiv

0+阅读 · 2022年4月19日

On Safety Testing, Validation, and Characterization with Scenario-Sampling: A Case Study of Legged Robots

Arxiv

1+阅读 · 2022年4月16日

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words

Arxiv

0+阅读 · 2022年4月16日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

相关基金

镜对称及双有理代数几何背景下的高维 Calabi-Yau 簇

国家自然科学基金

0+阅读 · 2016年12月31日

Berezin变换及相关的算子理论

国家自然科学基金

1+阅读 · 2014年12月31日

矩阵流形的信息几何结构及其应用

国家自然科学基金

3+阅读 · 2013年12月31日

血管平滑肌细胞5-羟色胺受体1B差异表达影响手臂振动病发生的核因子-κB信号机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

ICC胃动素受体在红霉素促胃肠动力中的作用及机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

下一代移动通信上行链路迭代接收机形式化设计方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

多天线OFDM信道全信息压缩估计理论与方法

国家自然科学基金

0+阅读 · 2011年12月31日

广义Kloosterman和的均值估计

国家自然科学基金

0+阅读 · 2011年12月31日

癌症相关受体EGFR、Fas、ER和AR与钙调素相互作用的晶体结构研究

国家自然科学基金

1+阅读 · 2009年12月31日

半枝莲活性成分双向调节VEGF与DC机制新探索

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员