Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling - 专知论文

会员服务 ·

0

吉布斯采样/吉布斯抽样 · CoT · Automator · 样本 · Performer ·

2023 年 5 月 17 日

Reprompting: Automated Chain-of-Thought Prompt Inference Through Gibbs Sampling

翻译：暂无翻译

Weijia Xu,Andrzej Banburski-Fahey,Nebojsa Jojic

We introduce Reprompting, an iterative sampling algorithm that searches for the Chain-of-Thought (CoT) recipes for a given task without human intervention. Through Gibbs sampling, we infer CoT recipes that work consistently well for a set of training samples. Our method iteratively samples new recipes using previously sampled solutions as parent prompts to solve other training problems. On five Big-Bench Hard tasks that require multi-step reasoning, Reprompting achieves consistently better performance than the zero-shot, few-shot, and human-written CoT baselines. Reprompting can also facilitate transfer of knowledge from a stronger model to a weaker model leading to substantially improved performance of the weaker model. Overall, Reprompting brings up to +17 point improvements over the previous state-of-the-art method that uses human-written CoT prompts.

翻译：暂无翻译

0

相关内容

吉布斯采样/吉布斯抽样

吉布斯采样/吉布斯抽样

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

动脉粥样硬化易损斑块光声分子显像与治疗基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

高速高精度少自由度并联机器人动力学鲁棒控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型多功能GO/HA基仿生材料的构建与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

硫醇-烯烃点击可控制备多功能N-P骨架超支化环氧树脂及性能

国家自然科学基金

0+阅读 · 2012年12月31日

上消化道癌症原位早期诊断激光拉曼光谱系统的研制

国家自然科学基金

0+阅读 · 2009年12月31日

Improved sampling via learned diffusions

Arxiv

0+阅读 · 2023年7月3日

Convex Optimization in Legged Robots

Arxiv

0+阅读 · 2023年6月30日

Class-Incremental Learning using Diffusion Model for Distillation and Replay

Arxiv

0+阅读 · 2023年6月30日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

VIP会员

文章信息

相关主题

吉布斯采样/吉布斯抽样

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机系统 - 反无人机系统：测试方法》364页

《无人机蜂群攻击防御的预测建模：面向美军战备的人工智能轨迹预测与最优拦截策略设计》最新报告

美军低成本无人作战攻击系统（LUCAS）：扩大无人机战争规模

《将空中力量带向海洋：美国海军航空发展的四条竞争路径及其教训》报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Improved sampling via learned diffusions

Arxiv

0+阅读 · 2023年7月3日

Convex Optimization in Legged Robots

Arxiv

0+阅读 · 2023年6月30日

Class-Incremental Learning using Diffusion Model for Distillation and Replay

Arxiv

0+阅读 · 2023年6月30日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

相关基金

动脉粥样硬化易损斑块光声分子显像与治疗基础研究

国家自然科学基金

0+阅读 · 2014年12月31日

高速高精度少自由度并联机器人动力学鲁棒控制研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型多功能GO/HA基仿生材料的构建与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

硫醇-烯烃点击可控制备多功能N-P骨架超支化环氧树脂及性能

国家自然科学基金

0+阅读 · 2012年12月31日

上消化道癌症原位早期诊断激光拉曼光谱系统的研制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员