使用大语言模型模拟多重人和复制人主题研究 (Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies) - 专知论文

会员服务 ·

0

语言模型化 · MoDELS · GPT-3 · 路径 · 讲稿 ·

2023 年 2 月 14 日

Using Large Language Models to Simulate Multiple Humans and Replicate Human Subject Studies

翻译：使用大语言模型模拟多重人和复制人主题研究

Gati Aher,Rosa I. Arriaga,Adam Tauman Kalai

from arxiv, Added Turing Experiment (TE) framing and Wisdom of Crowds TE

We introduce a new type of test, called a Turing Experiment (TE), for evaluating how well a language model, such as GPT-3, can simulate different aspects of human behavior. Unlike the Turing Test, which involves simulating a single arbitrary individual, a TE requires simulating a representative sample of participants in human subject research. We give TEs that attempt to replicate well-established findings in prior studies. We design a methodology for simulating TEs and illustrate its use to compare how well different language models are able to reproduce classic economic, psycholinguistic, and social psychology experiments: Ultimatum Game, Garden Path Sentences, Milgram Shock Experiment, and Wisdom of Crowds. In the first three TEs, the existing findings were replicated using recent models, while the last TE reveals a "hyper-accuracy distortion" present in some language models.

翻译：我们引入了一种新型测试,称为图灵实验(Turing 实验),用于评估诸如GPT-3(GPT-3)等语言模型能够模拟人类行为的各个方面有多好。与图灵试验(涉及模拟单一的任意性个人)不同,TE要求模拟具有代表性的人类主题研究参与者样本。我们给TE(TE)提供一种尝试在先前的研究中复制既定发现的方法。我们设计了一种模拟TE的方法,并用它来比较不同语言模型能够复制经典经济、精神语言和社会心理学实验(Ultimtum Game、花园路径判决、Milgram震荡实验和人群智慧实验)有多好。在前三个TE中,现有研究结果被使用最近的模型复制,而最后一个TE(TE)则揭示了某些语言模型中存在的“精度扭曲”现象。

0

相关内容

语言模型化

语言模型化

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

靶向控释siRNA调控S6K1选择性剪接逆转非小细胞肺癌放疗抵抗的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Plk2与ArgBP2相互作用在骨肉瘤细胞迁移中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

血清-巨噬细胞间胆固醇流动与动脉粥样硬化关系研究

国家自然科学基金

0+阅读 · 2011年12月31日

Puma和Bim在慢性淋巴细胞白血病细胞凋亡中的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

IER5基因调节宫颈癌放疗敏感性的功能及其作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation

Arxiv

0+阅读 · 2023年4月4日

Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT

Arxiv

3+阅读 · 2023年4月3日

Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study

Arxiv

0+阅读 · 2023年4月3日

Zero-shot meta-learning for small-scale data from human subjects

Arxiv

0+阅读 · 2023年4月1日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《基于AI的动态任务分配策略实现多智能体系统有意义人类控制》报告

《超越连接：AI驱动网络未来愿景》最新报告

人工智能赋能多域作战：能力与挑战

《战场空间决策优势：AI基础与应用研究》总结报告

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation

Arxiv

0+阅读 · 2023年4月4日

Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT

Arxiv

3+阅读 · 2023年4月3日

Exploring the Use of Large Language Models for Reference-Free Text Quality Evaluation: A Preliminary Empirical Study

Arxiv

0+阅读 · 2023年4月3日

Zero-shot meta-learning for small-scale data from human subjects

Arxiv

0+阅读 · 2023年4月1日

K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering

Arxiv

15+阅读 · 2021年9月22日

相关基金

靶向控释siRNA调控S6K1选择性剪接逆转非小细胞肺癌放疗抵抗的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

Plk2与ArgBP2相互作用在骨肉瘤细胞迁移中的作用及其机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

血清-巨噬细胞间胆固醇流动与动脉粥样硬化关系研究

国家自然科学基金

0+阅读 · 2011年12月31日

Puma和Bim在慢性淋巴细胞白血病细胞凋亡中的作用机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

IER5基因调节宫颈癌放疗敏感性的功能及其作用机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员