自我教学:用自生成指令调整语言模式 (Self-Instruct: Aligning Language Model with Self Generated Instructions) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · GPT3 · 原点 · tuning ·

2022 年 12 月 20 日

Self-Instruct: Aligning Language Model with Self Generated Instructions

翻译：自我教学:用自生成指令调整语言模式

Yizhong Wang,Yeganeh Kordi,Swaroop Mishra,Alisa Liu,Noah A. Smith,Daniel Khashabi,Hannaneh Hajishirzi

from arxiv, work in progress

Large "instruction-tuned" language models (finetuned to respond to instructions) have demonstrated a remarkable ability to generalize zero-shot to new tasks. Nevertheless, they depend heavily on human-written instruction data that is limited in quantity, diversity, and creativity, therefore hindering the generality of the tuned model. We introduce Self-Instruct, a framework for improving the instruction-following capabilities of pretrained language models by bootstrapping off its own generations. Our pipeline generates instruction, input, and output samples from a language model, then prunes them before using them to finetune the original model. Applying our method to vanilla GPT3, we demonstrate a 33% absolute improvement over the original model on Super-NaturalInstructions, on par with the performance of InstructGPT_001, which is trained with private user data and human annotations. For further evaluation, we curate a set of expert-written instructions for novel tasks, and show through human evaluation that tuning GPT3 with Self-Instruct outperforms using existing public instruction datasets by a large margin, leaving only a 5% absolute gap behind InstructGPT_001. Self-Instruct provides an almost annotation-free method for aligning pre-trained language models with instructions, and we release our large synthetic dataset to facilitate future studies on instruction tuning.

翻译：大型的“ 教化” 语言模型( 与指示相适应 ) 展示出将零点推广为新任务的巨大能力。然而,它们在很大程度上依赖于数量、多样性和创造性有限的人写教学数据,因此阻碍了调制模式的普遍性。我们引入了自我教学框架,即通过从自己几代人身上穿靴而提高预先训练的语言模型教学- 执行能力的框架。我们的管道从一种语言模型中生成教学、输入和产出样本,然后在使用它们来微化原始模型之前对它们进行精细化。将我们的方法应用到Vanilla GPT3, 我们展示了超自然教学原模型的33%的绝对改进,这与SantGPT_ 001的性能相当,它受过私人用户数据和人文说明的培训。为了进一步评估,我们为新任务制定了一套专家编写的指令,并通过人类评估显示,用自导3 校正以原始模型来微化。我们将现有的公共教学数据集应用到一个大边缘, 仅留下一个完全的指令的绝对值改进了33%的版本, 将“ ” 向后方向调整一个大方向, 提供一个完整的系统前。

1

相关内容

语言模型化

语言模型化

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

专知会员服务

84+阅读 · 2020年11月25日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

肥胖相关Hepatokine LECT2在肝脏中的调控及机制

国家自然科学基金

1+阅读 · 2015年12月31日

脂联素通过p38 MAPK-STAT5途径调节URSA中Th17/Treg失衡的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

LncRNA HOTTIP调控整合素信号通路参与DDH关节软骨纤维化的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

ERK3介导TNF-α调控头颈鳞癌淋巴管生成的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

应力敏感条件下的数字岩心微观渗流分析方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

等井径膨胀套管螺纹接头的结构完整性研究

国家自然科学基金

0+阅读 · 2013年12月31日

1111结构铁基超导材料的高压核磁共振研究

国家自然科学基金

0+阅读 · 2012年12月31日

αctinin 4介导NHERF1调节细胞微丝骨架及其对肿瘤细胞黏附与迁移的影响

国家自然科学基金

0+阅读 · 2011年12月31日

基于改性赤泥质多孔陶瓷滤料的混凝-聚结过滤耦合处理油田采出水及作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

Self-supervised Cloth Reconstruction via Action-conditioned Cloth Tracking

Arxiv

1+阅读 · 2023年2月19日

Learning Language Representations with Logical Inductive Bias

Arxiv

0+阅读 · 2023年2月19日

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Arxiv

0+阅读 · 2023年2月18日

Stable Deep MRI Reconstruction using Generative Priors

Arxiv

0+阅读 · 2023年2月17日

Multimodal Subtask Graph Generation from Instructional Videos

Arxiv

0+阅读 · 2023年2月17日

InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis

Arxiv

0+阅读 · 2023年2月16日

Efficient 3D Object Reconstruction using Visual Transformers

Arxiv

0+阅读 · 2023年2月16日

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Arxiv

1+阅读 · 2023年2月16日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

【UIUC】最新《自监督学习》教程，51页ppt，Self-supervised learning

专知会员服务

84+阅读 · 2020年11月25日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

102+阅读 · 2020年6月21日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

相关论文

Self-supervised Cloth Reconstruction via Action-conditioned Cloth Tracking

Arxiv

1+阅读 · 2023年2月19日

Learning Language Representations with Logical Inductive Bias

Arxiv

0+阅读 · 2023年2月19日

Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation

Arxiv

0+阅读 · 2023年2月18日

Stable Deep MRI Reconstruction using Generative Priors

Arxiv

0+阅读 · 2023年2月17日

Multimodal Subtask Graph Generation from Instructional Videos

Arxiv

0+阅读 · 2023年2月17日

InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis

Arxiv

0+阅读 · 2023年2月16日

Efficient 3D Object Reconstruction using Visual Transformers

Arxiv

0+阅读 · 2023年2月16日

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Arxiv

1+阅读 · 2023年2月16日

From Show to Tell: A Survey on Image Captioning

Arxiv

15+阅读 · 2021年7月14日

Making Pre-trained Language Models Better Few-shot Learners

Arxiv

14+阅读 · 2020年12月31日

相关基金

肥胖相关Hepatokine LECT2在肝脏中的调控及机制

国家自然科学基金

1+阅读 · 2015年12月31日

脂联素通过p38 MAPK-STAT5途径调节URSA中Th17/Treg失衡的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

LncRNA HOTTIP调控整合素信号通路参与DDH关节软骨纤维化的分子机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

ERK3介导TNF-α调控头颈鳞癌淋巴管生成的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

应力敏感条件下的数字岩心微观渗流分析方法研究

国家自然科学基金

0+阅读 · 2014年12月31日

PPAR β/δ基因在结直肠癌血管生成调控中的作用及分子机理

国家自然科学基金

2+阅读 · 2014年12月31日

等井径膨胀套管螺纹接头的结构完整性研究

国家自然科学基金

0+阅读 · 2013年12月31日

1111结构铁基超导材料的高压核磁共振研究

国家自然科学基金

0+阅读 · 2012年12月31日

αctinin 4介导NHERF1调节细胞微丝骨架及其对肿瘤细胞黏附与迁移的影响

国家自然科学基金

0+阅读 · 2011年12月31日

基于改性赤泥质多孔陶瓷滤料的混凝-聚结过滤耦合处理油田采出水及作用机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员