DecompX: Explaining Transformers Decisions by Propagating Token Decomposition - 专知论文

会员服务 ·

0

Analysis · MoDELS · 变换 · 词元分析器 · 向量化 ·

2023 年 6 月 5 日

DecompX: Explaining Transformers Decisions by Propagating Token Decomposition

翻译：暂无翻译

Ali Modarressi,Mohsen Fayyaz,Ehsan Aghazadeh,Yadollah Yaghoobzadeh,Mohammad Taher Pilehvar

from arxiv, Accepted to ACL 2023 (main conference)

An emerging solution for explaining Transformer-based models is to use vector-based analysis on how the representations are formed. However, providing a faithful vector-based explanation for a multi-layer model could be challenging in three aspects: (1) Incorporating all components into the analysis, (2) Aggregating the layer dynamics to determine the information flow and mixture throughout the entire model, and (3) Identifying the connection between the vector-based analysis and the model's predictions. In this paper, we present DecompX to tackle these challenges. DecompX is based on the construction of decomposed token representations and their successive propagation throughout the model without mixing them in between layers. Additionally, our proposal provides multiple advantages over existing solutions for its inclusion of all encoder components (especially nonlinear feed-forward networks) and the classification head. The former allows acquiring precise vectors while the latter transforms the decomposition into meaningful prediction-based values, eliminating the need for norm- or summation-based vector aggregation. According to the standard faithfulness evaluations, DecompX consistently outperforms existing gradient-based and vector-based approaches on various datasets. Our code is available at https://github.com/mohsenfayyaz/DecompX.

翻译：暂无翻译

0

相关内容

Analysis

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

肝细胞癌Gd-EOB-DTPA增强MR成像肝胆期信号的分子病理机制及对恶性程度与预后评估的研究

国家自然科学基金

0+阅读 · 2014年12月31日

控释VEGF/NT-3脊髓脱细胞支架在SCI模型中的血管化及神经再生研究

国家自然科学基金

0+阅读 · 2013年12月31日

柔性拦截网力学行为的等代理论及一致性模型

国家自然科学基金

1+阅读 · 2013年12月31日

外加应力及含水蒸气环境中CoNiCrAlY涂层表面氧化层的生长机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

人类胚胎脊柱及脊髓发育 MRI 与组织学对照研究

国家自然科学基金

0+阅读 · 2012年12月31日

染料聚集体/Au纳米结构中激子-表面等离子体耦合的电化学调制及生物分析应用

国家自然科学基金

0+阅读 · 2012年12月31日

模糊Domain中的一些范畴之间的对偶等价

国家自然科学基金

0+阅读 · 2012年12月31日

Reality-based Interaction用户界面模型和评估方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

近邻星系的中远红外性质

国家自然科学基金

0+阅读 · 2011年12月31日

超声造影微血管显像与乳腺癌血管生成和血管内皮生长因子（VEGF）表达的相关性研究

国家自然科学基金

0+阅读 · 2009年12月31日

Counterfactual Explanations for Graph Classification Through the Lenses of Density

Arxiv

0+阅读 · 2023年7月27日

Explainable Techniques for Analyzing Flow Cytometry Cell Transformers

Arxiv

0+阅读 · 2023年7月27日

MCMC-Correction of Score-Based Diffusion Models for Model Composition

Arxiv

0+阅读 · 2023年7月26日

On the Error-Reducing Properties of Superposition Codes

Arxiv

0+阅读 · 2023年7月25日

Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection

Arxiv

0+阅读 · 2023年7月25日

What Symptoms and How Long? An Interpretable AI Approach for Depression Detection in Social Media

Arxiv

1+阅读 · 2023年7月25日

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

Arxiv

0+阅读 · 2023年7月24日

Concept-based explainability for an EEG transformer model

Arxiv

0+阅读 · 2023年7月24日

Predicting Ordinary Differential Equations with Transformers

Arxiv

0+阅读 · 2023年7月24日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

VIP会员

文章信息

相关主题

词元分析器

相关VIP内容

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

CVPR 2023开会了！谷歌等最新《视觉上理解和解释注意力》教程，附152页ppt

专知会员服务

85+阅读 · 2023年6月19日

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

325+阅读 · 2020年11月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

仿生机器人技术的军事应用

《反集群作战：基于深度学习的分布式决策方法》89页

机器人领域中最佳的三维场景表示是什么？——从几何表示到基础模型

《多域作战兵棋推演：运用形态学分析与人工智能加强国防人员训练》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Counterfactual Explanations for Graph Classification Through the Lenses of Density

Arxiv

0+阅读 · 2023年7月27日

Explainable Techniques for Analyzing Flow Cytometry Cell Transformers

Arxiv

0+阅读 · 2023年7月27日

MCMC-Correction of Score-Based Diffusion Models for Model Composition

Arxiv

0+阅读 · 2023年7月26日

On the Error-Reducing Properties of Superposition Codes

Arxiv

0+阅读 · 2023年7月25日

Re-mine, Learn and Reason: Exploring the Cross-modal Semantic Correlations for Language-guided HOI detection

Arxiv

0+阅读 · 2023年7月25日

What Symptoms and How Long? An Interpretable AI Approach for Depression Detection in Social Media

Arxiv

1+阅读 · 2023年7月25日

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection

Arxiv

0+阅读 · 2023年7月24日

Concept-based explainability for an EEG transformer model

Arxiv

0+阅读 · 2023年7月24日

Predicting Ordinary Differential Equations with Transformers

Arxiv

0+阅读 · 2023年7月24日

The Principles of Deep Learning Theory

Arxiv

66+阅读 · 2021年6月18日

相关基金

肝细胞癌Gd-EOB-DTPA增强MR成像肝胆期信号的分子病理机制及对恶性程度与预后评估的研究

国家自然科学基金

0+阅读 · 2014年12月31日

控释VEGF/NT-3脊髓脱细胞支架在SCI模型中的血管化及神经再生研究

国家自然科学基金

0+阅读 · 2013年12月31日

柔性拦截网力学行为的等代理论及一致性模型

国家自然科学基金

1+阅读 · 2013年12月31日

外加应力及含水蒸气环境中CoNiCrAlY涂层表面氧化层的生长机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

人类胚胎脊柱及脊髓发育 MRI 与组织学对照研究

国家自然科学基金

0+阅读 · 2012年12月31日

染料聚集体/Au纳米结构中激子-表面等离子体耦合的电化学调制及生物分析应用

国家自然科学基金

0+阅读 · 2012年12月31日

模糊Domain中的一些范畴之间的对偶等价

国家自然科学基金

0+阅读 · 2012年12月31日

Reality-based Interaction用户界面模型和评估方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

近邻星系的中远红外性质

国家自然科学基金

0+阅读 · 2011年12月31日

超声造影微血管显像与乳腺癌血管生成和血管内皮生长因子（VEGF）表达的相关性研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员