Intext 检索增强型语言模型 (In-Context Retrieval-Augmented Language Models) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · 情景 · Boosting（一种模型训练加速方式） · Performance ·

2023 年 1 月 31 日

In-Context Retrieval-Augmented Language Models

翻译： Intext 检索增强型语言模型

Ori Ram,Yoav Levine,Itay Dalmedigos,Dor Muhlgay,Amnon Shashua,Kevin Leyton-Brown,Yoav Shoham

Retrieval-Augmented Language Modeling (RALM) methods, that condition a language model (LM) on relevant documents from a grounding corpus during generation, have been shown to significantly improve language modeling while also providing a natural source attribution mechanism. Existing RALM approaches focus on modifying the LM architecture in order to facilitate the incorporation of external information, significantly complicating deployment. This paper proposes an under-explored alternative, which we dub In-Context RALM: leaving the LM architecture unchanged and prepending grounding documents to the input. We show that in-context RALM which uses off-the-shelf general purpose retrievers provides surprisingly large LM gains across model sizes and diverse corpora. We also demonstrate that the document retrieval and ranking mechanism can be specialized to the RALM setting to further boost performance. We conclude that in-context RALM has considerable potential to increase the prevalence of LM grounding, particularly in settings where a pretrained LM must be used without modification or even via API access. To that end, we make our code publicly available.

翻译：现有语言模型方法侧重于修改LM结构,以便利纳入外部信息,使部署工作更加复杂化。本文提出了一个探索不足的替代方法,即我们把LM结构放在Context RALM上:将LM结构保持不变,将地面文件放在输入中。我们显示,使用现成一般用途检索器的文本RALM在模型大小和多种组合中带来了令人惊讶的LM大增益。我们还表明,文件检索和排序机制可以专门用于RALM环境,以进一步提升性能。我们的结论是,文中RALM有很大潜力增加LM地基的普及,特别是在必须不经修改或甚至通过API访问而使用预先训练的LM的环境下。为此,我们公开提供我们的代码。

0

相关内容

语言模型化

语言模型化

多模态认知计算

多模态认知计算

专知会员服务

180+阅读 · 2022年9月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

酿酒酵母酪蛋白激酶CK2参与调控过氧化物酶体生物发生的分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

TRAF3IP3调控T细胞活性与肿瘤免疫的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

Iotrochota属海绵及其内生菌中含氮化合物及其抗肿瘤活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于核生长动力学的纳米硒-多糖复合物制备过程研究

国家自然科学基金

0+阅读 · 2013年12月31日

人巨细胞病毒潜伏感染的自噬调控及相关IE2-Akt-Beclin 1通路的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Numbl-TRAF6-TAB2对NF-kappa B活性的调节在小胶质细胞炎性活化中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

β-内酯类20S蛋白酶体抑制剂的设计、合成与活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于氢键相互作用的发光分子/LDHs功能薄膜的构筑与光学性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

双金属络合物催化碳-氯键和碳-氧键的活化和转化研究

国家自然科学基金

0+阅读 · 2011年12月31日

二苯乙烯苷对氧化应激诱导的内皮细胞凋亡的影响

国家自然科学基金

0+阅读 · 2008年12月31日

Inherent Consistent Learning for Accurate Semi-supervised Medical Image Segmentation

Inherent Consistent Learning for Accurate Semi-supervised Medical Image Segmentation

Arxiv

0+阅读 · 2023年3月24日

Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions

Arxiv

0+阅读 · 2023年3月24日

Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

Arxiv

0+阅读 · 2023年3月23日

Retrieval-Augmented Classification with Decoupled Representation

Arxiv

0+阅读 · 2023年3月23日

Text with Knowledge Graph Augmented Transformer for Video Captioning

Arxiv

0+阅读 · 2023年3月22日

The Life Cycle of Knowledge in Big Language Models: A Survey

Arxiv

28+阅读 · 2023年3月14日

Data Augmentation Approaches in Natural Language Processing: A Survey

Arxiv

18+阅读 · 2021年10月5日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

VIP会员

文章信息

相关主题

语言模型化

Boosting（一种模型训练加速方式）

相关VIP内容

多模态认知计算

多模态认知计算

专知会员服务

180+阅读 · 2022年9月16日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】扩展可扩展会话推荐的边界

别想太多：高效 R1 风格大型推理模型综述

【ACMMM2025】EvoVLMA: 进化式视觉-语言模型自适应

智能体网络：用AI智能体编织下一代网络

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

BERT/Transformer/迁移学习NLP资源大列表

BERT/Transformer/迁移学习NLP资源大列表

专知

19+阅读 · 2019年6月9日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

谷歌发表的史上最强NLP模型BERT的官方代码和预训练模型可以下载了

AINLP

12+阅读 · 2018年11月1日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

相关论文

Inherent Consistent Learning for Accurate Semi-supervised Medical Image Segmentation

Inherent Consistent Learning for Accurate Semi-supervised Medical Image Segmentation

Arxiv

0+阅读 · 2023年3月24日

Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions

Arxiv

0+阅读 · 2023年3月24日

Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

Development and validation of a natural language processing algorithm to pseudonymize documents in the context of a clinical data warehouse

Arxiv

0+阅读 · 2023年3月23日

Retrieval-Augmented Classification with Decoupled Representation

Arxiv

0+阅读 · 2023年3月23日

Text with Knowledge Graph Augmented Transformer for Video Captioning

Arxiv

0+阅读 · 2023年3月22日

The Life Cycle of Knowledge in Big Language Models: A Survey

Arxiv

28+阅读 · 2023年3月14日

Data Augmentation Approaches in Natural Language Processing: A Survey

Arxiv

18+阅读 · 2021年10月5日

On the Opportunities and Risks of Foundation Models

Arxiv

30+阅读 · 2021年8月18日

Multilingual Sentiment Analysis: An RNN-Based Framework for Limited Data

Arxiv

12+阅读 · 2018年6月8日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

相关基金

酿酒酵母酪蛋白激酶CK2参与调控过氧化物酶体生物发生的分子机制研究

国家自然科学基金

0+阅读 · 2016年12月31日

TRAF3IP3调控T细胞活性与肿瘤免疫的分子机制

国家自然科学基金

0+阅读 · 2016年12月31日

Iotrochota属海绵及其内生菌中含氮化合物及其抗肿瘤活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于核生长动力学的纳米硒-多糖复合物制备过程研究

国家自然科学基金

0+阅读 · 2013年12月31日

人巨细胞病毒潜伏感染的自噬调控及相关IE2-Akt-Beclin 1通路的作用机制

国家自然科学基金

0+阅读 · 2012年12月31日

Numbl-TRAF6-TAB2对NF-kappa B活性的调节在小胶质细胞炎性活化中的作用

国家自然科学基金

0+阅读 · 2012年12月31日

β-内酯类20S蛋白酶体抑制剂的设计、合成与活性研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于氢键相互作用的发光分子/LDHs功能薄膜的构筑与光学性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

双金属络合物催化碳-氯键和碳-氧键的活化和转化研究

国家自然科学基金

0+阅读 · 2011年12月31日

二苯乙烯苷对氧化应激诱导的内皮细胞凋亡的影响

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员