与检索增强语言模式学习 (Few-shot Learning with Retrieval Augmented Language Models) - 专知论文

会员服务 ·

0

语言模型化 · 知识 (knowledge) · 小样本学习 · Learning · MoDELS ·

2022 年 8 月 8 日

Few-shot Learning with Retrieval Augmented Language Models

翻译：与检索增强语言模式学习

Gautier Izacard,Patrick Lewis,Maria Lomeli,Lucas Hosseini,Fabio Petroni,Timo Schick,Jane Dwivedi-Yu,Armand Joulin,Sebastian Riedel,Edouard Grave

Large language models have shown impressive few-shot results on a wide range of tasks. However, when knowledge is key for such results, as is the case for tasks such as question answering and fact checking, massive parameter counts to store knowledge seem to be needed. Retrieval augmented models are known to excel at knowledge intensive tasks without the need for as many parameters, but it is unclear whether they work in few-shot settings. In this work we present Atlas, a carefully designed and pre-trained retrieval augmented language model able to learn knowledge intensive tasks with very few training examples. We perform evaluations on a wide range of tasks, including MMLU, KILT and NaturalQuestions, and study the impact of the content of the document index, showing that it can easily be updated. Notably, Atlas reaches over 42% accuracy on Natural Questions using only 64 examples, outperforming a 540B parameters model by 3% despite having 50x fewer parameters.

翻译：大型语言模型在一系列广泛的任务中显示出了令人印象深刻的微小结果。然而,当知识是这种结果的关键时,例如问题回答和事实调查等任务,似乎需要大量的参数来储存知识。检索扩展模型被认为在知识密集型任务方面非常出色,不需要同样多的参数,但尚不清楚它们是否在几眼环境中发挥作用。在这项工作中,我们介绍Atlas系统是一个经过仔细设计和预先训练的检索增强语言模型,能够以极少的培训实例来学习知识密集型任务。我们对包括MMMLU、KILT和自然问题在内的广泛任务进行评估,并研究文件索引内容的影响,表明它很容易更新。值得注意的是,Atlas系统仅使用64个实例,在自然问题上达到42%的精确度,比540B参数模型高出3%,尽管参数减少了50倍。

0

相关内容

语言模型化

语言模型化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

金属氧化物/金刚石异质结制备及其性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁性碳纳米管的原位制备机理及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

利用离子阱颗粒质谱对微球表面吸附的定量表征

国家自然科学基金

0+阅读 · 2013年12月31日

Fe、Co、Ni超细纳米结构制备与催化放氢研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于微流控芯片的细胞间药物转运与化学信号传递研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于LOC的微米级油液颗粒污染物区分检测机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

可吸入矿物细颗粒与常见菌的近尺寸作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

PCOS患者卵巢颗粒细胞对卵子及早期胚胎发育潜能的基因调控

国家自然科学基金

0+阅读 · 2011年12月31日

添加新型富钡相RE242制备高超导性能REBCO块材

国家自然科学基金

0+阅读 · 2011年12月31日

新型固相萃取材料的制备及其在痕量重金属元素与形态分析中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Arxiv

0+阅读 · 2022年10月6日

MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text

Arxiv

0+阅读 · 2022年10月6日

Training from a Better Start Point: Active Self-Semi-Supervised Learning for Few Labeled Samples

Arxiv

0+阅读 · 2022年10月5日

Prototypical Calibration for Few-shot Learning of Language Models

Arxiv

0+阅读 · 2022年10月5日

Recitation-Augmented Language Models

Arxiv

0+阅读 · 2022年10月4日

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation

Arxiv

0+阅读 · 2022年10月3日

What Makes Pre-trained Language Models Better Zero/Few-shot Learners?

Arxiv

0+阅读 · 2022年9月30日

Compositional Semantic Parsing with Large Language Models

Arxiv

0+阅读 · 2022年9月30日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

VIP会员

文章信息

相关主题

语言模型化

知识 (knowledge)

小样本学习

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Retrieval of Soft Prompt Enhances Zero-Shot Task Generalization

Arxiv

0+阅读 · 2022年10月6日

MuRAG: Multimodal Retrieval-Augmented Generator for Open Question Answering over Images and Text

Arxiv

0+阅读 · 2022年10月6日

Training from a Better Start Point: Active Self-Semi-Supervised Learning for Few Labeled Samples

Arxiv

0+阅读 · 2022年10月5日

Prototypical Calibration for Few-shot Learning of Language Models

Arxiv

0+阅读 · 2022年10月5日

Recitation-Augmented Language Models

Arxiv

0+阅读 · 2022年10月4日

Context-Tuning: Learning Contextualized Prompts for Natural Language Generation

Arxiv

0+阅读 · 2022年10月3日

What Makes Pre-trained Language Models Better Zero/Few-shot Learners?

Arxiv

0+阅读 · 2022年9月30日

Compositional Semantic Parsing with Large Language Models

Arxiv

0+阅读 · 2022年9月30日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

Learning beyond datasets: Knowledge Graph Augmented Neural Networks for Natural language Processing

Arxiv

11+阅读 · 2018年2月16日

相关基金

金属氧化物/金刚石异质结制备及其性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

磁性碳纳米管的原位制备机理及性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

利用离子阱颗粒质谱对微球表面吸附的定量表征

国家自然科学基金

0+阅读 · 2013年12月31日

Fe、Co、Ni超细纳米结构制备与催化放氢研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于微流控芯片的细胞间药物转运与化学信号传递研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于LOC的微米级油液颗粒污染物区分检测机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

可吸入矿物细颗粒与常见菌的近尺寸作用研究

国家自然科学基金

0+阅读 · 2011年12月31日

PCOS患者卵巢颗粒细胞对卵子及早期胚胎发育潜能的基因调控

国家自然科学基金

0+阅读 · 2011年12月31日

添加新型富钡相RE242制备高超导性能REBCO块材

国家自然科学基金

0+阅读 · 2011年12月31日

新型固相萃取材料的制备及其在痕量重金属元素与形态分析中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员