大语言模型知道人类知道什么吗? (Do Large Language Models know what humans know?) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · Analysis · Performer · state-of-the-art ·

2022 年 9 月 4 日

Do Large Language Models know what humans know?

翻译：大语言模型知道人类知道什么吗?

Sean Trott,Cameron Jones,Tyler Chang,James Michaelov,Benjamin Bergen

Humans can attribute mental states to others, a capacity known as Theory of Mind. However, it is unknown to what extent this ability results from an innate biological endowment or from experience accrued through child development, particularly exposure to language describing others' mental states. We test the viability of the language exposure hypothesis by assessing whether models exposed to large quantities of human language develop evidence of Theory of Mind. In a pre-registered analysis, we present a linguistic version of the False Belief Task, widely used to assess Theory of Mind, to both human participants and a state-of-the-art Large Language Model, GPT-3. Both are sensitive to others' beliefs, but the language model does not perform as well as the humans, nor does it explain the full extent of their behavior, despite being exposed to more language than a human would in a lifetime. This suggests that while language exposure may in part explain how humans develop Theory of Mind, other mechanisms are also responsible.

翻译：人类可以将精神状态归结于他人,一种称为“思想理论”的能力。然而,尚不清楚这种能力在多大程度上来自天生的生物天赋或通过儿童发育积累的经验,特别是接触描述他人精神状态的语言。我们通过评估接触大量人类语言的模型是否发展了精神理论的证据,检验语言暴露假设的可行性。在预先登记的分析中,我们向人类参与者和最先进的大语言模型GPT-3提供了广泛用于评估思想理论的语言版本。两者都对他人的信仰敏感,但语言模型的表现和人的表现都不尽人意,也没有解释其行为的全部程度,尽管在一生中接触的语言多于人的意愿。这说明语言暴露可能部分地解释人类如何发展思想理论,但其他机制也有责任。

0

相关内容

语言模型化

语言模型化

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

104+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

基于稀有事件模拟技术的金融衍生品组合风险度量及应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

非线性偏微分方程解的渐近性态研究

国家自然科学基金

0+阅读 · 2014年12月31日

含阶梯梯度泡沫金属的陶瓷复合装甲动态力学响应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Frénd相变热力学模型发展方程组整体解及其渐近性态

国家自然科学基金

0+阅读 · 2012年12月31日

滇西老厂富银红土型锰矿次生富集机制及40Ar/39Ar年龄

国家自然科学基金

0+阅读 · 2012年12月31日

MDSCs在动脉粥样硬化中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

从巨噬细胞中LXR-CCR7交互作用探讨丹参素抗动脉粥样硬化机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于图域几何PDE与特征不变量的离散曲面处理

国家自然科学基金

0+阅读 · 2009年12月31日

金融资产变结构波动的非参数GARCH建模及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Large Language Models for Multi-label Propaganda Detection

Arxiv

0+阅读 · 2022年10月20日

Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy

Arxiv

0+阅读 · 2022年10月20日

Are Large Pre-Trained Language Models Leaking Your Personal Information?

Arxiv

0+阅读 · 2022年10月20日

Understanding Jargon: Combining Extraction and Generation for Definition Modeling

Arxiv

0+阅读 · 2022年10月20日

TabLLM: Few-shot Classification of Tabular Data with Large Language Models

Arxiv

0+阅读 · 2022年10月19日

Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information

Arxiv

0+阅读 · 2022年10月19日

Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models

Arxiv

0+阅读 · 2022年10月19日

PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation

Arxiv

0+阅读 · 2022年10月19日

On the Importance of Architectures and Hyperparameters for Fairness in Face Recognition

On the Importance of Architectures and Hyperparameters for Fairness in Face Recognition

Arxiv

0+阅读 · 2022年10月18日

Revisiting Contextual Toxicity Detection in Conversations

Arxiv

0+阅读 · 2022年10月18日

VIP会员

文章信息

相关主题

语言模型化

state-of-the-art

相关VIP内容

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

104+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

Al Agent：AI时代的软件革命

大型语言模型推理引擎的综述：优化与效率的视角

【ICML2025】关于语言模型对齐中奖励模型稳健性的研究

【阿姆斯特丹博士论文】终端设备上的高效深度学习推理

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Large Language Models for Multi-label Propaganda Detection

Arxiv

0+阅读 · 2022年10月20日

Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy

Arxiv

0+阅读 · 2022年10月20日

Are Large Pre-Trained Language Models Leaking Your Personal Information?

Arxiv

0+阅读 · 2022年10月20日

Understanding Jargon: Combining Extraction and Generation for Definition Modeling

Arxiv

0+阅读 · 2022年10月20日

TabLLM: Few-shot Classification of Tabular Data with Large Language Models

Arxiv

0+阅读 · 2022年10月19日

Towards Procedural Fairness: Uncovering Biases in How a Toxic Language Classifier Uses Sentiment Information

Arxiv

0+阅读 · 2022年10月19日

Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models

Arxiv

0+阅读 · 2022年10月19日

PLATO-XL: Exploring the Large-scale Pre-training of Dialogue Generation

Arxiv

0+阅读 · 2022年10月19日

On the Importance of Architectures and Hyperparameters for Fairness in Face Recognition

On the Importance of Architectures and Hyperparameters for Fairness in Face Recognition

Arxiv

0+阅读 · 2022年10月18日

Revisiting Contextual Toxicity Detection in Conversations

Arxiv

0+阅读 · 2022年10月18日

相关基金

基于稀有事件模拟技术的金融衍生品组合风险度量及应用研究

国家自然科学基金

0+阅读 · 2014年12月31日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

非线性偏微分方程解的渐近性态研究

国家自然科学基金

0+阅读 · 2014年12月31日

含阶梯梯度泡沫金属的陶瓷复合装甲动态力学响应研究

国家自然科学基金

0+阅读 · 2012年12月31日

Frénd相变热力学模型发展方程组整体解及其渐近性态

国家自然科学基金

0+阅读 · 2012年12月31日

滇西老厂富银红土型锰矿次生富集机制及40Ar/39Ar年龄

国家自然科学基金

0+阅读 · 2012年12月31日

MDSCs在动脉粥样硬化中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

从巨噬细胞中LXR-CCR7交互作用探讨丹参素抗动脉粥样硬化机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于图域几何PDE与特征不变量的离散曲面处理

国家自然科学基金

0+阅读 · 2009年12月31日

金融资产变结构波动的非参数GARCH建模及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员