测量和操作语言模型中的知识表示 (Measuring and Manipulating Knowledge Representations in Language Models) - 专知论文

会员服务 ·

0

表示 · 知识 · 知识表示 · 语言模型 · 模型编辑 ·

2023 年 4 月 3 日

Measuring and Manipulating Knowledge Representations in Language Models

翻译：测量和操作语言模型中的知识表示

Evan Hernandez,Belinda Z. Li,Jacob Andreas

Neural language models (LMs) represent facts about the world described by text. Sometimes these facts derive from training data (in most LMs, a representation of the word banana encodes the fact that bananas are fruits). Sometimes facts derive from input text itself (a representation of the sentence "I poured out the bottle" encodes the fact that the bottle became empty). Tools for inspecting and modifying LM fact representations would be useful almost everywhere LMs are used: making it possible to update them when the world changes, to localize and remove sources of bias, and to identify errors in generated text. We describe REMEDI, an approach for querying and modifying factual knowledge in LMs. REMEDI learns a map from textual queries to fact encodings in an LM's internal representation system. These encodings can be used as knowledge editors: by adding them to LM hidden representations, we can modify downstream generation to be consistent with new facts. REMEDI encodings can also be used as model probes: by comparing them to LM representations, we can ascertain what properties LMs attribute to mentioned entities, and predict when they will generate outputs that conflict with background knowledge or input text. REMEDI thus links work on probing, prompting, and model editing, and offers steps toward general tools for fine-grained inspection and control of knowledge in LMs.

翻译：神经语言模型（LM）表示有关文本所描述的世界的事实。有时这些事实源自训练数据（在大多数LM中，对“香蕉”这个词的表示表明香蕉是水果）。有时事实来源于输入文本本身（对“我倒出瓶子”的表示表明瓶子变空了）。在LM使用的几乎所有领域，查询和修改LM事实表示的工具都将非常有用：可以用于在世界变化时更新事实，定位和清除偏见源，以及识别生成文本中的错误。我们描述了REMEDI，一种查询和修改LM中实际知识的方法。REMEDI学习了从文本查询到LM内部表示系统中的事实编码的映射。这些编码可以用作知识编辑器：通过将它们添加到LM隐藏表示中，我们可以修改下游生成以与新事实保持一致。REMEDI编码也可以用作模型探头：通过将它们与LM表示进行比较，我们可以确定LM属性归因于提到的实体，预测它们何时会生成与背景知识或输入文本冲突的输出。因此，REMEDI将探测，提示和模型编辑的工作联系起来，为LM的知识细粒度检查和控制提供了步骤。

0

相关内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【伯克利干货书稿】机器学习全面指南，185页pdf, Comprehensive Guide to ML

【伯克利干货书稿】机器学习全面指南，185页pdf, Comprehensive Guide to ML

专知会员服务

46+阅读 · 2021年12月28日

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

专知会员服务

43+阅读 · 2020年11月22日

【文本生成现代方法】Modern Methods for Text Generation

【文本生成现代方法】Modern Methods for Text Generation

专知会员服务

44+阅读 · 2020年9月11日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

103+阅读 · 2020年4月25日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日

【AAAI2020知识图谱论文概述】Knowledge Graphs @ AAAI 2020

【AAAI2020知识图谱论文概述】Knowledge Graphs @ AAAI 2020

专知会员服务

134+阅读 · 2020年2月13日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

读书报告 | CN-DBpedia: A Chinese Knowledge Extraction System

读书报告 | CN-DBpedia: A Chinese Knowledge Extraction System

科技创新与创业

19+阅读 · 2018年1月4日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

本体的解释诊断理论研究

国家自然科学基金

8+阅读 · 2014年12月31日

定位线粒体或溶酶体的双态型比率荧光探针的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于上下文信息的社交网络图像分析与理解

国家自然科学基金

0+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

APE1线粒体调控ROS介导骨肉瘤化疗耐药的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

mTOR信号通路介导SIRT3调控糖尿病肾病系膜细胞肥大的作用及分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的深层网络数据集成方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

林木高质量遗传图谱构建和QTL精确定位统计方法及应用

国家自然科学基金

0+阅读 · 2008年12月31日

Measuring Faithful and Plausible Visual Grounding in VQA

Arxiv

0+阅读 · 2023年5月24日

Language-Agnostic Bias Detection in Language Models

Arxiv

0+阅读 · 2023年5月22日

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Arxiv

0+阅读 · 2023年5月22日

Inspecting and Editing Knowledge Representations in Language Models

Arxiv

0+阅读 · 2023年5月22日

Mitigating ML Model Decay in Continuous Integration with Data Drift Detection: An Empirical Study

Arxiv

0+阅读 · 2023年5月22日

Data Models Applied to Soft Robot Modeling and Control: A Review

Arxiv

0+阅读 · 2023年5月20日

Self-Reinforcement Attention Mechanism For Tabular Learning

Arxiv

0+阅读 · 2023年5月19日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

A Survey of Knowledge-Enhanced Text Generation

Arxiv

18+阅读 · 2020年10月9日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

VIP会员

文章信息

相关主题

相关VIP内容

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

【ICDM 2022教程】图挖掘中的公平性:度量、算法和应用

专知会员服务

28+阅读 · 2022年12月26日

【伯克利干货书稿】机器学习全面指南，185页pdf, Comprehensive Guide to ML

【伯克利干货书稿】机器学习全面指南，185页pdf, Comprehensive Guide to ML

专知会员服务

46+阅读 · 2021年12月28日

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

专知会员服务

43+阅读 · 2020年11月22日

【文本生成现代方法】Modern Methods for Text Generation

【文本生成现代方法】Modern Methods for Text Generation

专知会员服务

44+阅读 · 2020年9月11日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

103+阅读 · 2020年4月25日

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

【论文翻译】NLP注意力机制综述论文翻译，Attention, please! A Critical Review of Neural Attention Models in Natural Language Processing

专知会员服务

96+阅读 · 2020年4月18日

【AAAI2020知识图谱论文概述】Knowledge Graphs @ AAAI 2020

【AAAI2020知识图谱论文概述】Knowledge Graphs @ AAAI 2020

专知会员服务

134+阅读 · 2020年2月13日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

最新《扩散模型原理》新书，470页pdf

无人机作战：演进、创新与未来战场

AI 智能体简史

多模态空间推理在大模型时代：综述与基准测试

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

读书报告 | CN-DBpedia: A Chinese Knowledge Extraction System

读书报告 | CN-DBpedia: A Chinese Knowledge Extraction System

科技创新与创业

19+阅读 · 2018年1月4日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Measuring Faithful and Plausible Visual Grounding in VQA

Arxiv

0+阅读 · 2023年5月24日

Language-Agnostic Bias Detection in Language Models

Arxiv

0+阅读 · 2023年5月22日

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Arxiv

0+阅读 · 2023年5月22日

Inspecting and Editing Knowledge Representations in Language Models

Arxiv

0+阅读 · 2023年5月22日

Mitigating ML Model Decay in Continuous Integration with Data Drift Detection: An Empirical Study

Arxiv

0+阅读 · 2023年5月22日

Data Models Applied to Soft Robot Modeling and Control: A Review

Arxiv

0+阅读 · 2023年5月20日

Self-Reinforcement Attention Mechanism For Tabular Learning

Arxiv

0+阅读 · 2023年5月19日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

A Survey of Knowledge-Enhanced Text Generation

Arxiv

18+阅读 · 2020年10月9日

Which Knowledge Graph Is Best for Me?

Arxiv

11+阅读 · 2018年9月28日

相关基金

本体的解释诊断理论研究

国家自然科学基金

8+阅读 · 2014年12月31日

定位线粒体或溶酶体的双态型比率荧光探针的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于上下文信息的社交网络图像分析与理解

国家自然科学基金

0+阅读 · 2013年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

APE1线粒体调控ROS介导骨肉瘤化疗耐药的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

microRNA调节肿瘤抑制因子Caliban应答DNA损伤的机制

国家自然科学基金

1+阅读 · 2012年12月31日

mTOR信号通路介导SIRT3调控糖尿病肾病系膜细胞肥大的作用及分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于本体的深层网络数据集成方法研究

国家自然科学基金

2+阅读 · 2009年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

林木高质量遗传图谱构建和QTL精确定位统计方法及应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员