GPT 中定位和编辑事实协会 (Locating and Editing Factual Associations in GPT) - 专知论文

会员服务 ·

0

查全率/召回率 · MoDELS · 可辨认的 · INTERACT · Performer ·

2022 年 6 月 1 日

Locating and Editing Factual Associations in GPT

翻译：GPT 中定位和编辑事实协会

Kevin Meng,David Bau,Alex Andonian,Yonatan Belinkov

from arxiv, 28 pages, 23 figures. Code and data at https://rome.baulab.info/

We analyze the storage and recall of factual associations in autoregressive transformer language models, finding evidence that these associations correspond to localized, directly-editable computations. We first develop a causal intervention for identifying neuron activations that are decisive in a model's factual predictions. This reveals a distinct set of steps in middle-layer feed-forward modules that mediate factual predictions while processing subject tokens. To test our hypothesis that these computations correspond to factual association recall, we modify feed-forward weights to update specific factual associations using Rank-One Model Editing (ROME). We find that ROME is effective on a standard zero-shot relation extraction (zsRE) model-editing task, comparable to existing methods. To perform a more sensitive evaluation, we also evaluate ROME on a new dataset of counterfactual assertions, on which it simultaneously maintains both specificity and generalization, whereas other methods sacrifice one or another. Our results confirm an important role for mid-layer feed-forward modules in storing factual associations and suggest that direct manipulation of computational mechanisms may be a feasible approach for model editing. The code, dataset, visualizations, and an interactive demo notebook are available at https://rome.baulab.info/

翻译：我们分析自动递减变压器语言模型中事实关联的存储和回顾情况,找到这些关联符合本地、直接编辑计算的证据。我们首先开发因果干预,以识别在模型事实预测中具有决定性意义的神经激活。这揭示了中层进化前方向模块中一套截然不同的步骤,在其中对事实预测进行调解,同时处理主题符号。为了检验我们的假设,即这些计算与事实关联相对应,我们修改进向前加权,以利用标准“一号模型编辑”更新具体的事实关联。我们发现,ROME在标准零点关系提取(zRE)模型编辑任务上是有效的,可以与现有方法相比。为了进行更敏感的评估,我们还评估了一套反事实指控的新数据集的ROME,同时保持了特性和概括性,而其他方法则牺牲了一种或另一种。我们的结果证实了中层进向前进模块在存储事实关联中的重要作用,并建议直接调整计算机制可能是模式编辑的一种可行方法。代码、数据设置、可视像化和互动的演示式,在 https_drobredudealations。

0

相关内容

查全率/召回率

查全率/召回率

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

白酒发酵环境对乳杆菌细菌素群体感应合成调控行为的影响及其机制

国家自然科学基金

0+阅读 · 2016年12月31日

长非编码RNA在Her2阳性乳腺癌中的调控作用

国家自然科学基金

0+阅读 · 2014年12月31日

二穗短柄草对小麦条锈菌近非寄主抗性的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

番茄晚疫病抗性相关基因的发掘、表达谱及功能的研究

国家自然科学基金

0+阅读 · 2014年12月31日

低能团簇离子注入制备宽禁带氟化石墨烯(fluorographene)

国家自然科学基金

0+阅读 · 2013年12月31日

大尺寸单晶生长过程中的若干基础问题

国家自然科学基金

0+阅读 · 2012年12月31日

多波长荧光互相关光谱仪（FCCS）研制

国家自然科学基金

0+阅读 · 2012年12月31日

琥珀蚕茧色形成机理研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于化学反应飞秒相干控制的飞秒时间分辨相干Raman光谱仪的研制

国家自然科学基金

0+阅读 · 2011年12月31日

nanog对牙髓干细胞增殖分化的影响及信号通路调控

国家自然科学基金

0+阅读 · 2011年12月31日

Efficient Dependency Analysis for Rule-Based Ontologies

Arxiv

0+阅读 · 2022年7月20日

FedNet2Net: Saving Communication and Computations in Federated Learning with Model Growing

Arxiv

0+阅读 · 2022年7月19日

MIA 2022 Shared Task Submission: Leveraging Entity Representations, Dense-Sparse Hybrids, and Fusion-in-Decoder for Cross-Lingual Question Answering

Arxiv

0+阅读 · 2022年7月18日

Can large language models reason about medical questions?

Arxiv

0+阅读 · 2022年7月17日

CheXplaining in Style: Counterfactual Explanations for Chest X-rays using StyleGAN

CheXplaining in Style: Counterfactual Explanations for Chest X-rays using StyleGAN

Arxiv

0+阅读 · 2022年7月15日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Pretrained Transformers for Text Ranking: BERT and Beyond

Arxiv

28+阅读 · 2020年10月13日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Efficiently Embedding Dynamic Knowledge Graphs

Efficiently Embedding Dynamic Knowledge Graphs

Arxiv

14+阅读 · 2019年10月15日

K-BERT: Enabling Language Representation with Knowledge Graph

K-BERT: Enabling Language Representation with Knowledge Graph

Arxiv

19+阅读 · 2019年9月17日

VIP会员

文章信息

相关主题

查全率/召回率

相关VIP内容

自然语言处理顶会NAACL2022最佳论文出炉！

自然语言处理顶会NAACL2022最佳论文出炉！

专知会员服务

43+阅读 · 2022年6月30日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

大型语言模型遇上文本属性图：一种融合框架与应用的综述

人工智能赋能自主武器与人类控制第三部分：人类控制与系统操作员 | 35页

【博士论文】用于概率程序与生成模型的变分推断

军事指挥控制系统：2025年5种用途

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

会议交流 | IJCKG: International Joint Conference on Knowledge Graphs

开放知识图谱

0+阅读 · 2021年9月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Efficient Dependency Analysis for Rule-Based Ontologies

Arxiv

0+阅读 · 2022年7月20日

FedNet2Net: Saving Communication and Computations in Federated Learning with Model Growing

Arxiv

0+阅读 · 2022年7月19日

MIA 2022 Shared Task Submission: Leveraging Entity Representations, Dense-Sparse Hybrids, and Fusion-in-Decoder for Cross-Lingual Question Answering

Arxiv

0+阅读 · 2022年7月18日

Can large language models reason about medical questions?

Arxiv

0+阅读 · 2022年7月17日

CheXplaining in Style: Counterfactual Explanations for Chest X-rays using StyleGAN

CheXplaining in Style: Counterfactual Explanations for Chest X-rays using StyleGAN

Arxiv

0+阅读 · 2022年7月15日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Pretrained Transformers for Text Ranking: BERT and Beyond

Arxiv

28+阅读 · 2020年10月13日

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Arxiv

12+阅读 · 2020年6月23日

Efficiently Embedding Dynamic Knowledge Graphs

Efficiently Embedding Dynamic Knowledge Graphs

Arxiv

14+阅读 · 2019年10月15日

K-BERT: Enabling Language Representation with Knowledge Graph

K-BERT: Enabling Language Representation with Knowledge Graph

Arxiv

19+阅读 · 2019年9月17日

相关基金

白酒发酵环境对乳杆菌细菌素群体感应合成调控行为的影响及其机制

国家自然科学基金

0+阅读 · 2016年12月31日

长非编码RNA在Her2阳性乳腺癌中的调控作用

国家自然科学基金

0+阅读 · 2014年12月31日

二穗短柄草对小麦条锈菌近非寄主抗性的分子机理研究

国家自然科学基金

0+阅读 · 2014年12月31日

番茄晚疫病抗性相关基因的发掘、表达谱及功能的研究

国家自然科学基金

0+阅读 · 2014年12月31日

低能团簇离子注入制备宽禁带氟化石墨烯(fluorographene)

国家自然科学基金

0+阅读 · 2013年12月31日

大尺寸单晶生长过程中的若干基础问题

国家自然科学基金

0+阅读 · 2012年12月31日

多波长荧光互相关光谱仪（FCCS）研制

国家自然科学基金

0+阅读 · 2012年12月31日

琥珀蚕茧色形成机理研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于化学反应飞秒相干控制的飞秒时间分辨相干Raman光谱仪的研制

国家自然科学基金

0+阅读 · 2011年12月31日

nanog对牙髓干细胞增殖分化的影响及信号通路调控

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员