语文模型 (Recitation-Augmented Language Models) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · state-of-the-art · HotpotQA · Performer ·

2022 年 10 月 4 日

Recitation-Augmented Language Models

翻译：语文模型

Zhiqing Sun,Xuezhi Wang,Yi Tay,Yiming Yang,Denny Zhou

We propose a new paradigm to help Large Language Models (LLMs) generate more accurate factual knowledge without retrieving from an external corpus, called RECITation-augmented gEneration (RECITE). Different from retrieval-augmented language models that retrieve relevant documents before generating the outputs, given an input, RECITE first recites one or several relevant passages from LLMs' own memory via sampling, and then produces the final answers. We show that RECITE is a powerful paradigm for knowledge-intensive NLP tasks. Specifically, we show that by utilizing recitation as the intermediate step, a recite-and-answer scheme can achieve new state-of-the-art performance in various closed-book question answering (CBQA) tasks. In experiments, we verify the effectiveness of RECITE on three pre-trained models (PaLM, UL2, and OPT) and three CBQA tasks (Natural Questions, TriviaQA, and HotpotQA).

翻译：我们提出了一个新的范式,以帮助大语言模型(LLMs)产生更准确的事实知识,而无需从外部外源获取,称为RECITation-Angeled gEnergation(REGITE ) 。不同于检索强化语言模型(RECITE ), 该模型在产生产出之前检索相关文件,根据一个投入,RETET首先从LLMs自己的记忆中通过抽样读取一个或几个相关段落,然后提出最后答案。我们显示RECTE是知识密集型NLP任务的一个强有力的范例。具体地说,我们通过将回引作为中间步骤,我们表明在各种非公开问题解答(CBQA)任务中,一个读和答方案可以实现新的最新业绩。在实验中,我们核查RECTTE在三个预培训模式(PALM、UL2和IAL)和CBAM任务(Natalal Ques、TriviaQA和HotpotQA)上的有效性。

0

相关内容

语言模型化

语言模型化

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Bi/BiVO4@mSiO2三元异质结构光催化降解抗生素废水的性能及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

氢分子影响动脉粥样斑块稳定性及其巨噬细胞内质网应激凋亡途径的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

小半夏加茯苓汤诱导肿瘤细胞凋亡途径及其机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

离子液体功能化手性Bronsted酸催化剂创制及其在催化反应中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

COPEN: Probing Conceptual Knowledge in Pre-trained Language Models

Arxiv

0+阅读 · 2022年11月8日

Measuring Progress on Scalable Oversight for Large Language Models

Arxiv

1+阅读 · 2022年11月4日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning

Arxiv

27+阅读 · 2021年1月21日

Memory Augmented Graph Neural Networks for Sequential Recommendation

Memory Augmented Graph Neural Networks for Sequential Recommendation

Arxiv

13+阅读 · 2019年12月26日

VIP会员

文章信息

相关主题

语言模型化

state-of-the-art

相关VIP内容

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【博士论文】面向真实世界音视联合语音识别的可扩展框架

《通过仿真与开源数据提升战略决策：机遇与局限》最新报告

【AAAI2026】善始则事半功倍：基于前缀优化的大语言模型推理强化学习

评估大语言模型在科学发现中的作用

相关资讯

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

COPEN: Probing Conceptual Knowledge in Pre-trained Language Models

Arxiv

0+阅读 · 2022年11月8日

Measuring Progress on Scalable Oversight for Large Language Models

Arxiv

1+阅读 · 2022年11月4日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

KG-BART: Knowledge Graph-Augmented BART for Generative Commonsense Reasoning

Arxiv

27+阅读 · 2021年1月21日

Memory Augmented Graph Neural Networks for Sequential Recommendation

Memory Augmented Graph Neural Networks for Sequential Recommendation

Arxiv

13+阅读 · 2019年12月26日

相关基金

Bi/BiVO4@mSiO2三元异质结构光催化降解抗生素废水的性能及机理研究

国家自然科学基金

0+阅读 · 2015年12月31日

氢分子影响动脉粥样斑块稳定性及其巨噬细胞内质网应激凋亡途径的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

小半夏加茯苓汤诱导肿瘤细胞凋亡途径及其机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

离子液体功能化手性Bronsted酸催化剂创制及其在催化反应中的应用

国家自然科学基金

0+阅读 · 2011年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员