ChatGPT参加计算机科学考试 (ChatGPT Participates in a Computer Science Exam) - 专知论文

会员服务 ·

0

ChatGPT · 计算机科学 · Performer · HTTPS · 可理解性 ·

2023 年 3 月 8 日

ChatGPT Participates in a Computer Science Exam

翻译：ChatGPT参加计算机科学考试

Sebastian Bordt,Ulrike von Luxburg

We asked ChatGPT to participate in an undergraduate computer science exam on ''Algorithms and Data Structures''. We evaluated the program on the entire exam as posed to the students. We hand-copied its answers onto an exam sheet, which was subsequently graded in a blind setup alongside those of 200 participating students. We find that ChatGPT narrowly passed the exam, obtaining 20.5 out of 40 points. This impressive performance indicates that ChatGPT can indeed succeed in challenging tasks like university exams. At the same time, the tasks in our exam are structurally similar to those on other exams, solved homework problems, and teaching materials that can be found online. Therefore, it would be premature to conclude from this experiment that ChatGPT has any understanding of computer science. The transcript of our conversation with ChatGPT is available at \url{https://github.com/tml-tuebingen/chatgpt-algorithm-exam}, and the entire graded exam is in the appendix of this paper.

翻译：我们请ChatGPT参加了一场针对"算法和数据结构"的本科生计算机科学考试。我们对整场考试进行了评估，将ChatGPT的答案手工抄写到答题卡上，并与参加考试的200名学生的答案一同进行了盲审。我们发现ChatGPT勉强通过了考试，获得了40分中的20.5分。这一优异表现表明，ChatGPT确实可以在像大学考试这样的挑战性任务中取得成功。与此同时，我们考试中的任务在结构上与其他考试、课程作业问题以及网上可以找到的教材非常相似。因此，从这个实验中得出ChatGPT具有计算机科学理解能力的结论是过早的。我们的ChatGPT对话记录可以在\url{https://github.com/tml-tuebingen/chatgpt-algorithm-exam}中找到，整个评分过的考试可以在本文附录中找到。

0

相关内容

ChatGPT

ChatGPT（全名：Chat Generative Pre-trained Transformer），美国OpenAI 研发的聊天机器人程序 [1] ，于2022年11月30日发布。ChatGPT是人工智能技术驱动的自然语言处理工具，它能够通过学习和理解人类的语言来进行对话，还能根据聊天的上下文进行互动，真正像人类一样来聊天交流，甚至能完成撰写邮件、视频脚本、文案、翻译、代码，写论文任务。 [1] https://openai.com/blog/chatgpt/

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

机器之心

0+阅读 · 2022年10月15日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

麦冬皂苷通过下调lnc-MALAT1抑制NSCLC血管生成的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

EB病毒ncRNA在Burkitt淋巴瘤发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

肾癌的磁共振扩散峰度成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

旋进电子衍射的动力学校正及纳米晶的三维重构

国家自然科学基金

0+阅读 · 2012年12月31日

NLRP3炎性小体介导同型半胱氨酸诱导动脉粥样硬化炎症反应的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

糖尿病血管钙化的新机制：高糖诱导内皮细胞－成骨细胞转分化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

组团参加国际光学联合会大会

国家自然科学基金

0+阅读 · 2012年8月18日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

中国9- - 18岁城市学生攻击行为评定常模研制及攻击个体社会认知的fMRI研究

国家自然科学基金

0+阅读 · 2009年12月31日

Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks

Arxiv

0+阅读 · 2023年5月8日

XAI in Computational Linguistics: Understanding Political Leanings in the Slovenian Parliament

Arxiv

0+阅读 · 2023年5月8日

Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting

Arxiv

0+阅读 · 2023年5月7日

No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation

Arxiv

0+阅读 · 2023年5月7日

Science and Technology Ontology: A Taxonomy of Emerging Topics

Arxiv

0+阅读 · 2023年5月6日

Designing Bugs or Doing Another Project: Effects on Secondary Students' Self-Beliefs in Computer Science

Arxiv

0+阅读 · 2023年5月4日

PersonaLLM: Investigating the Ability of GPT-3.5 to Express Personality Traits and Gender Differences

Arxiv

0+阅读 · 2023年5月4日

Correcting for Interference in Experiments: A Case Study at Douyin

Arxiv

0+阅读 · 2023年5月4日

Beyond case studies: Teaching data science critique and ethics through sociotechnical surveillance studies

Arxiv

0+阅读 · 2023年5月3日

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Arxiv

14+阅读 · 2019年1月17日

VIP会员

文章信息

相关主题

计算机科学

相关VIP内容

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

126+阅读 · 2022年4月21日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

2020数据工程师成长路线图

专知会员服务

19+阅读 · 2020年9月6日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

生物信息学中的生成式人工智能：模型、应用与方法学进展的系统性综述

《COA-GPT 2.0：加速军事决策流程的代理式人工智能规划工具》

【博士论文】因果机器学习中的数据质量研究：算法公平性的应用

作战计算：数学模型预测军事冲突场景——科学方法如何协助识别热点冲突区域（附论文）

相关资讯

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

Purdue电子与计算机工程系李海桐NanoX实验室招收AI硬件全奖博士生（2023秋季）

机器之心

0+阅读 · 2022年10月15日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

计算机 | 入门级EI会议ICVRIS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年6月24日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

计算机 | EMNLP 2019等国际会议信息6条

计算机 | EMNLP 2019等国际会议信息6条

Call4Papers

18+阅读 · 2019年4月26日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

大数据 | 顶级SCI期刊专刊/国际会议信息7条

大数据 | 顶级SCI期刊专刊/国际会议信息7条

Call4Papers

10+阅读 · 2018年12月29日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks

Arxiv

0+阅读 · 2023年5月8日

XAI in Computational Linguistics: Understanding Political Leanings in the Slovenian Parliament

Arxiv

0+阅读 · 2023年5月8日

Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting

Arxiv

0+阅读 · 2023年5月7日

No More Manual Tests? Evaluating and Improving ChatGPT for Unit Test Generation

Arxiv

0+阅读 · 2023年5月7日

Science and Technology Ontology: A Taxonomy of Emerging Topics

Arxiv

0+阅读 · 2023年5月6日

Designing Bugs or Doing Another Project: Effects on Secondary Students' Self-Beliefs in Computer Science

Arxiv

0+阅读 · 2023年5月4日

PersonaLLM: Investigating the Ability of GPT-3.5 to Express Personality Traits and Gender Differences

Arxiv

0+阅读 · 2023年5月4日

Correcting for Interference in Experiments: A Case Study at Douyin

Arxiv

0+阅读 · 2023年5月4日

Beyond case studies: Teaching data science critique and ethics through sociotechnical surveillance studies

Arxiv

0+阅读 · 2023年5月3日

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Taking Human out of Learning Applications: A Survey on Automated Machine Learning

Arxiv

14+阅读 · 2019年1月17日

相关基金

麦冬皂苷通过下调lnc-MALAT1抑制NSCLC血管生成的机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

EB病毒ncRNA在Burkitt淋巴瘤发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

肾癌的磁共振扩散峰度成像研究

国家自然科学基金

0+阅读 · 2013年12月31日

Kronheimer-Nakajima quiver 模空间与有理曲面

国家自然科学基金

1+阅读 · 2013年12月31日

旋进电子衍射的动力学校正及纳米晶的三维重构

国家自然科学基金

0+阅读 · 2012年12月31日

NLRP3炎性小体介导同型半胱氨酸诱导动脉粥样硬化炎症反应的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

糖尿病血管钙化的新机制：高糖诱导内皮细胞－成骨细胞转分化的研究

国家自然科学基金

0+阅读 · 2012年12月31日

组团参加国际光学联合会大会

国家自然科学基金

0+阅读 · 2012年8月18日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

中国9- - 18岁城市学生攻击行为评定常模研制及攻击个体社会认知的fMRI研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员