为什么要逐步思考？推理源于经验的局部性 (Why think step-by-step? Reasoning emerges from the locality of experience) - 专知论文

会员服务 ·

0

局部性 · 训练数据 · 语言模型 · 匹配条件 · 统计结构 ·

2023 年 4 月 7 日

Why think step-by-step? Reasoning emerges from the locality of experience

翻译：为什么要逐步思考？推理源于经验的局部性

Ben Prystawski,Noah D. Goodman

from arxiv, 8 pages, 3 figures

Humans have a powerful and mysterious capacity to reason. By working through a series of purely mental steps, we can make inferences we would not be capable of making directly -- despite that fact that we get no additional data from the world. Similarly, large language models can perform better at complex tasks through chain-of-thought reasoning, where they generate intermediate steps before answering a question. We use language models to investigate the questions of when and why reasoning is helpful, testing the hypothesis that reasoning is effective when training data consisting of local clusters of variables that influence each other strongly. These training conditions enable the chaining of accurate local inferences in order to estimate relationships between variables that were not seen together in training. We train an autoregressive transformer on samples from joint distributions defined by Bayes nets, but only include a subset of all the variables in each sample. We compare language models' ability to match conditional probabilities both with and without intermediate reasoning steps, finding that intermediate steps help only when the training data is locally structured with respect to dependencies between variables. Furthermore, intermediate variables need to be relevant to the relationship between observed information and target inferences. Our results illustrate how the statistical structure of training data drives the effectiveness of reasoning step by step.

翻译：人类有一个强大而神秘的推理能力。通过进行一系列纯粹的思想步骤，我们能够进行我们无法直接进行的推断——尽管我们并没有从世界中获取到额外的数据。类似地，大型语言模型可以通过链式推理在复杂任务上表现更好，其中它们在回答问题之前生成中间步骤。我们使用语言模型来研究推理何时及为何有用，测试推理在训练数据由强烈相互影响的局部变量群集组成时，是否有效的假设。这些训练条件使得准确的局部推论可以被链接起来，以估计在训练中没有一起出现的变量之间的关系。我们根据Bayes网络定义的联合分布的样本训练一个自回归变换器，但每个样本中只包括部分变量。我们比较语言模型在有和没有中间推理步骤的情况下匹配条件概率的能力，发现中间步骤仅在训练数据与变量之间的依赖关系局部结构有关时才有帮助。此外，中间变量需要与观察信息和目标推理之间的关系相关。我们的结果说明了训练数据的统计结构驱动了逐步推理的有效性。

1

相关内容

局部性

现在大火的“In-context Learning”是什么？北大等最新《语境学习ICL》综述论文，详述ICL进展、挑战和方向

现在大火的“In-context Learning”是什么？北大等最新《语境学习ICL》综述论文，详述ICL进展、挑战和方向

专知会员服务

41+阅读 · 2023年1月3日

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

专知会员服务

99+阅读 · 2022年7月7日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【斯坦福大学-PNAS2020】人工智能中深度学习的不合理有效性unreasonable effectiveness of DL

专知会员服务

14+阅读 · 2020年2月23日

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

专知会员服务

47+阅读 · 2020年2月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

论文浅尝 | Language Models (Mostly) Know What They Know

论文浅尝 | Language Models (Mostly) Know What They Know

开放知识图谱

2+阅读 · 2022年11月18日

微软发现了一个超简单的NLP上分技巧，还发了ACL2022 ？？

微软发现了一个超简单的NLP上分技巧，还发了ACL2022 ？？

夕小瑶的卖萌屋

0+阅读 · 2022年5月30日

可达鸭为什么这么火？

可达鸭为什么这么火？

ZEALER订阅号

0+阅读 · 2022年5月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

开放知识图谱

14+阅读 · 2018年4月3日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

45+阅读 · 2015年12月31日

基于绝对统计平衡的等离子体湍流研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

基于因果构造和推理的专家判断关键技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

离散事件动态系统的模糊概率模型及最优监控研究

国家自然科学基金

0+阅读 · 2012年12月31日

Smad3调控前列腺癌进展的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

分布式决策问题的同步度研究

国家自然科学基金

5+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

NIS安全风险评估中不确定性推理建模与风险传播问题研究

国家自然科学基金

1+阅读 · 2009年12月31日

可学习的脉冲耦合神经网络与基于视-听觉融合的人机交互方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Arxiv

0+阅读 · 2023年5月29日

Passive learning of active causal strategies in agents and language models

Arxiv

0+阅读 · 2023年5月25日

Emergence of a phonological bias in ChatGPT

Arxiv

1+阅读 · 2023年5月25日

Zero-Shot Classification by Logical Reasoning on Natural Language Explanations

Arxiv

0+阅读 · 2023年5月25日

Complex Logical Reasoning over Knowledge Graphs using Large Language Models

Arxiv

0+阅读 · 2023年5月24日

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

Arxiv

11+阅读 · 2023年3月10日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Transfer Adaptation Learning: A Decade Survey

Transfer Adaptation Learning: A Decade Survey

Arxiv

37+阅读 · 2019年3月12日

VIP会员

文章信息

相关主题

相关VIP内容

现在大火的“In-context Learning”是什么？北大等最新《语境学习ICL》综述论文，详述ICL进展、挑战和方向

现在大火的“In-context Learning”是什么？北大等最新《语境学习ICL》综述论文，详述ICL进展、挑战和方向

专知会员服务

41+阅读 · 2023年1月3日

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

253页PPT！《因果性Causality》教程，哥本哈根大学Jonas Peters讲授

专知会员服务

99+阅读 · 2022年7月7日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【斯坦福大学-PNAS2020】人工智能中深度学习的不合理有效性unreasonable effectiveness of DL

专知会员服务

14+阅读 · 2020年2月23日

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

【哈佛大学】机器学习的层次局限性，A Hierarchy of Limitations in Machine Learning

专知会员服务

47+阅读 · 2020年2月12日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《FPV武装无人机的战斗飞行艺术与科学》最新报告

《基于分层多智能体强化学习的逼真空战协同策略》

《反无人机：用于无人机探测与定位的多输入多输出雷达》最新69页

《生成式人工智能及其在防御性网络安全课程中的应用》

相关资讯

论文浅尝 | Language Models (Mostly) Know What They Know

论文浅尝 | Language Models (Mostly) Know What They Know

开放知识图谱

2+阅读 · 2022年11月18日

微软发现了一个超简单的NLP上分技巧，还发了ACL2022 ？？

微软发现了一个超简单的NLP上分技巧，还发了ACL2022 ？？

夕小瑶的卖萌屋

0+阅读 · 2022年5月30日

可达鸭为什么这么火？

可达鸭为什么这么火？

ZEALER订阅号

0+阅读 · 2022年5月23日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

论文浅尝 | 基于神经网络的推理（DeepMind Relational Reasoning）

开放知识图谱

14+阅读 · 2018年4月3日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

相关论文

Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models

Arxiv

0+阅读 · 2023年5月29日

Passive learning of active causal strategies in agents and language models

Arxiv

0+阅读 · 2023年5月25日

Emergence of a phonological bias in ChatGPT

Arxiv

1+阅读 · 2023年5月25日

Zero-Shot Classification by Logical Reasoning on Natural Language Explanations

Arxiv

0+阅读 · 2023年5月25日

Complex Logical Reasoning over Knowledge Graphs using Large Language Models

Arxiv

0+阅读 · 2023年5月24日

Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning

Arxiv

11+阅读 · 2023年3月10日

Towards Reasoning in Large Language Models: A Survey

Arxiv

34+阅读 · 2022年12月20日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

Transfer Adaptation Learning: A Decade Survey

Transfer Adaptation Learning: A Decade Survey

Arxiv

37+阅读 · 2019年3月12日

相关基金

基于自主学习的Ad hoc Agent序贯决策研究

国家自然科学基金

45+阅读 · 2015年12月31日

基于绝对统计平衡的等离子体湍流研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

基于因果构造和推理的专家判断关键技术研究

国家自然科学基金

3+阅读 · 2012年12月31日

离散事件动态系统的模糊概率模型及最优监控研究

国家自然科学基金

0+阅读 · 2012年12月31日

Smad3调控前列腺癌进展的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

分布式决策问题的同步度研究

国家自然科学基金

5+阅读 · 2011年12月31日

改进Max-SAT算法的关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

NIS安全风险评估中不确定性推理建模与风险传播问题研究

国家自然科学基金

1+阅读 · 2009年12月31日

可学习的脉冲耦合神经网络与基于视-听觉融合的人机交互方法研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员