自己归属: 解释信息互动内部变换器 (Self-Attention Attribution: Interpreting Information Interactions Inside Transformer) - 专知论文

会员服务 ·

0

INTERACT · INFORMS · Extensibility · 可辨认的 · Performer ·

2021 年 2 月 25 日

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

翻译：自己归属: 解释信息互动内部变换器

Yaru Hao,Li Dong,Furu Wei,Ke Xu

from arxiv, AAAI-2021

The great success of Transformer-based models benefits from the powerful multi-head self-attention mechanism, which learns token dependencies and encodes contextual information from the input. Prior work strives to attribute model decisions to individual input features with different saliency measures, but they fail to explain how these input features interact with each other to reach predictions. In this paper, we propose a self-attention attribution method to interpret the information interactions inside Transformer. We take BERT as an example to conduct extensive studies. Firstly, we apply self-attention attribution to identify the important attention heads, while others can be pruned with marginal performance degradation. Furthermore, we extract the most salient dependencies in each layer to construct an attribution tree, which reveals the hierarchical interactions inside Transformer. Finally, we show that the attribution results can be used as adversarial patterns to implement non-targeted attacks towards BERT.

翻译：以变异器为基础的模型的巨大成功得益于强大的多头自我注意机制,该机制从输入中学习象征性依赖性,并编码背景信息。先前的工作努力将模型决定归因于具有不同突出度的单个输入特征,但未能解释这些输入特征如何相互作用,以得出预测。在本文中,我们提出一种自我注意归属方法来解释变异器内部的信息互动。我们以BERT为例进行广泛的研究。首先,我们运用自我注意归属来确定重要的关注负责人,而其他人则可以随着边际性能退化而消化。此外,我们提取了每一层中最突出的相互依赖性,以构建一个归属树,揭示变异器内部的等级互动。最后,我们表明,归因结果可以用作对抗模式,对BERT实施非有针对性的攻击。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

【EMNLP2020】高性能自然语言处理，274页ppt详述最新Transformer等技术进展

【EMNLP2020】高性能自然语言处理，274页ppt详述最新Transformer等技术进展

专知会员服务

61+阅读 · 2020年11月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

注意力图神经网络的小样本学习

注意力图神经网络的小样本学习

专知会员服务

192+阅读 · 2020年7月16日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

【斯坦福CS520】知识图谱到底是什么？从各顶会看知识图谱定义

【斯坦福CS520】知识图谱到底是什么？从各顶会看知识图谱定义

专知会员服务

78+阅读 · 2020年6月10日

《可解释的机器学习-interpretable-ml》238页pdf

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

AI可解释性文献列表

AI可解释性文献列表

专知

42+阅读 · 2019年10月7日

BERT源码分析PART I

BERT源码分析PART I

AINLP

38+阅读 · 2019年7月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

清华大学孙茂松组：图神经网络必读论文列表

清华大学孙茂松组：图神经网络必读论文列表

机器之心

46+阅读 · 2018年12月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Abusing Cache Line Dirty States to Leak Information in Commercial Processors

Arxiv

0+阅读 · 2021年4月17日

Some Geometrical and Topological Properties of DNNs' Decision Boundaries

Arxiv

0+阅读 · 2021年4月16日

Pretrained Transformers Improve Out-of-Distribution Robustness

Arxiv

5+阅读 · 2020年4月13日

Visualizing and Measuring the Geometry of BERT

Visualizing and Measuring the Geometry of BERT

Arxiv

7+阅读 · 2019年10月28日

Revealing the Dark Secrets of BERT

Revealing the Dark Secrets of BERT

Arxiv

4+阅读 · 2019年9月11日

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Arxiv

4+阅读 · 2019年9月2日

Multimodal Deep Network Embedding with Integrated Structure and Attribute Information

Multimodal Deep Network Embedding with Integrated Structure and Attribute Information

Arxiv

4+阅读 · 2019年3月28日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Universal Transformers

Universal Transformers

Arxiv

5+阅读 · 2019年3月5日

Music Transformer

Music Transformer

Arxiv

5+阅读 · 2018年12月12日

VIP会员

文章信息

相关主题

相关VIP内容

【EMNLP2020】高性能自然语言处理，274页ppt详述最新Transformer等技术进展

【EMNLP2020】高性能自然语言处理，274页ppt详述最新Transformer等技术进展

专知会员服务

61+阅读 · 2020年11月21日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

注意力图神经网络的小样本学习

注意力图神经网络的小样本学习

专知会员服务

192+阅读 · 2020年7月16日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

【斯坦福CS520】知识图谱到底是什么？从各顶会看知识图谱定义

【斯坦福CS520】知识图谱到底是什么？从各顶会看知识图谱定义

专知会员服务

78+阅读 · 2020年6月10日

《可解释的机器学习-interpretable-ml》238页pdf

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

AI可解释性文献列表

AI可解释性文献列表

专知

42+阅读 · 2019年10月7日

BERT源码分析PART I

BERT源码分析PART I

AINLP

38+阅读 · 2019年7月12日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

清华大学孙茂松组：图神经网络必读论文列表

清华大学孙茂松组：图神经网络必读论文列表

机器之心

46+阅读 · 2018年12月27日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Abusing Cache Line Dirty States to Leak Information in Commercial Processors

Arxiv

0+阅读 · 2021年4月17日

Some Geometrical and Topological Properties of DNNs' Decision Boundaries

Arxiv

0+阅读 · 2021年4月16日

Pretrained Transformers Improve Out-of-Distribution Robustness

Arxiv

5+阅读 · 2020年4月13日

Visualizing and Measuring the Geometry of BERT

Visualizing and Measuring the Geometry of BERT

Arxiv

7+阅读 · 2019年10月28日

Revealing the Dark Secrets of BERT

Revealing the Dark Secrets of BERT

Arxiv

4+阅读 · 2019年9月11日

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Summit: Scaling Deep Learning Interpretability by Visualizing Activation and Attribution Summarizations

Arxiv

4+阅读 · 2019年9月2日

Multimodal Deep Network Embedding with Integrated Structure and Attribute Information

Multimodal Deep Network Embedding with Integrated Structure and Attribute Information

Arxiv

4+阅读 · 2019年3月28日

On Attribution of Recurrent Neural Network Predictions via Additive Decomposition

Arxiv

3+阅读 · 2019年3月27日

Universal Transformers

Universal Transformers

Arxiv

5+阅读 · 2019年3月5日

Music Transformer

Music Transformer

Arxiv

5+阅读 · 2018年12月12日

微信扫码咨询专知VIP会员