解释有矛盾解释的语文模式 (Interpreting Language Models with Contrastive Explanations) - 专知论文

会员服务 ·

0

contrastive · 语言模型化 · MoDELS · 输出空间 · 词元分析器 ·

2022 年 2 月 21 日

Interpreting Language Models with Contrastive Explanations

翻译：解释有矛盾解释的语文模式

Kayo Yin,Graham Neubig

Model interpretability methods are often used to explain NLP model decisions on tasks such as text classification, where the output space is relatively small. However, when applied to language generation, where the output space often consists of tens of thousands of tokens, these methods are unable to provide informative explanations. Language models must consider various features to predict a token, such as its part of speech, number, tense, or semantics. Existing explanation methods conflate evidence for all these features into a single explanation, which is less interpretable for human understanding. To disentangle the different decisions in language modeling, we focus on explaining language models contrastively: we look for salient input tokens that explain why the model predicted one token instead of another. We demonstrate that contrastive explanations are quantifiably better than non-contrastive explanations in verifying major grammatical phenomena, and that they significantly improve contrastive model simulatability for human observers. We also identify groups of contrastive decisions where the model uses similar evidence, and we are able to characterize what input tokens models use during various language generation decisions.

翻译：模型解释方法通常用于解释关于文本分类等任务的模型决定,因为输出空间相对较小。然而,当应用到语言生成时,当输出空间通常由数万个符号组成时,这些方法无法提供说明性的解释。语言模型必须考虑各种特性来预测符号,例如其语言、数字、时态或语义部分。现有的解释方法将所有这些特征的证据混为一种单一的解释,这种解释性方法对于人类理解而言不那么容易解释。为了分解语言建模中的不同决定,我们侧重于解释语言模型:我们寻找突出的投入符号,解释为什么模型预测一个符号而不是另一个符号。我们表明,对比性解释比核实主要语法现象时的非争议性解释要好得多,而且它们大大改进了人类观察员的对比性模型模拟性。我们还确定了模型使用类似证据的对比性决定组,我们可以辨别各种语言生成决定中使用的输入符号。

0

相关内容

contrastive

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

专知

17+阅读 · 2018年4月19日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

可拓支持向量机理论、方法与应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

晚期汉-英二语者句法加工的调节机制：行为与ERP研究

国家自然科学基金

0+阅读 · 2013年12月31日

异质社会网络信息可信度评估与建模研究

国家自然科学基金

0+阅读 · 2013年12月31日

含有缺失值的纵向数据回归模型的稳健推断

国家自然科学基金

3+阅读 · 2012年12月31日

不完全数据推断方法的进一步讨论

国家自然科学基金

0+阅读 · 2012年12月31日

有限理性下的自媒体证券信息传播：资源价值与负面效应

国家自然科学基金

0+阅读 · 2012年12月31日

动力系统中热力学形式和维数理论的交叉研究

国家自然科学基金

0+阅读 · 2012年12月31日

变厚度电火花线切割加工机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

灌注微生物反应器扩增培养ADSCs在缺血性脑梗死动物模型中的功能性神经网络构建

国家自然科学基金

0+阅读 · 2011年12月31日

分布式元数据一致性与XBRL财务报告质量控制

国家自然科学基金

0+阅读 · 2011年12月31日

K-LITE: Learning Transferable Visual Models with External Knowledge

Arxiv

2+阅读 · 2022年4月20日

GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraints

Arxiv

0+阅读 · 2022年4月19日

A survey on improving NLP models with human explanations

Arxiv

0+阅读 · 2022年4月19日

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Arxiv

0+阅读 · 2022年4月15日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

A Survey on Visual Transformer

Arxiv

19+阅读 · 2020年12月23日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

VIP会员

文章信息

相关主题

语言模型化

词元分析器

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

ExBert — 可视化分析Transformer学到的表示

ExBert — 可视化分析Transformer学到的表示

专知会员服务

32+阅读 · 2019年10月16日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

大语言模型智能体强化学习：全景综述

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

【伯克利博士论文】从推理服务到训练：面向大规模 LLM 智能体的高效系统

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

【论文推荐】最新七篇视觉问答（VQA）相关论文—差别注意力机制、视觉问题推理、视觉对话、数据可视化、记忆增强网络、显式推理

专知

17+阅读 · 2018年4月19日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

相关论文

K-LITE: Learning Transferable Visual Models with External Knowledge

Arxiv

2+阅读 · 2022年4月20日

GAM(e) changer or not? An evaluation of interpretable machine learning models based on additive model constraints

Arxiv

0+阅读 · 2022年4月19日

A survey on improving NLP models with human explanations

Arxiv

0+阅读 · 2022年4月19日

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Evaluating few shot and Contrastive learning Methods for Code Clone Detection

Arxiv

0+阅读 · 2022年4月15日

Learning and Evaluating Graph Neural Network Explanations based on Counterfactual and Factual Reasoning

Arxiv

17+阅读 · 2022年2月17日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

A Survey on Visual Transformer

Arxiv

19+阅读 · 2020年12月23日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

相关基金

可拓支持向量机理论、方法与应用研究

国家自然科学基金

1+阅读 · 2014年12月31日

晚期汉-英二语者句法加工的调节机制：行为与ERP研究

国家自然科学基金

0+阅读 · 2013年12月31日

异质社会网络信息可信度评估与建模研究

国家自然科学基金

0+阅读 · 2013年12月31日

含有缺失值的纵向数据回归模型的稳健推断

国家自然科学基金

3+阅读 · 2012年12月31日

不完全数据推断方法的进一步讨论

国家自然科学基金

0+阅读 · 2012年12月31日

有限理性下的自媒体证券信息传播：资源价值与负面效应

国家自然科学基金

0+阅读 · 2012年12月31日

动力系统中热力学形式和维数理论的交叉研究

国家自然科学基金

0+阅读 · 2012年12月31日

变厚度电火花线切割加工机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

灌注微生物反应器扩增培养ADSCs在缺血性脑梗死动物模型中的功能性神经网络构建

国家自然科学基金

0+阅读 · 2011年12月31日

分布式元数据一致性与XBRL财务报告质量控制

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员