系统评价《守则》大语言模式 (A Systematic Evaluation of Large Language Models of Code) - 专知论文

会员服务 ·

0

代码 · 语言模型化 · MoDELS · 可辨认的 · AIM ·

2022 年 5 月 4 日

A Systematic Evaluation of Large Language Models of Code

翻译：系统评价《守则》大语言模式

Frank F. Xu,Uri Alon,Graham Neubig,Vincent J. Hellendoorn

from arxiv, DL4C@ICLR 2022, and MAPS@PLDI 2022

Large language models (LMs) of code have recently shown tremendous promise in completing code and synthesizing code from natural language descriptions. However, the current state-of-the-art code LMs (e.g., Codex (Chen et al., 2021)) are not publicly available, leaving many questions about their model and data design decisions. We aim to fill in some of these blanks through a systematic evaluation of the largest existing models: Codex, GPT-J, GPT-Neo, GPT-NeoX-20B, and CodeParrot, across various programming languages. Although Codex itself is not open-source, we find that existing open-source models do achieve close results in some programming languages, although targeted mainly for natural language modeling. We further identify an important missing piece in the form of a large open-source model trained exclusively on a multi-lingual corpus of code. We release a new model, PolyCoder, with 2.7B parameters based on the GPT-2 architecture, which was trained on 249GB of code across 12 programming languages on a single machine. In the C programming language, PolyCoder outperforms all models including Codex. Our trained models are open-source and publicly available at https://github.com/VHellendoorn/Code-LMs, which enables future research and application in this area.

翻译：守则的大型语言模型(LMs)最近显示,在完成守则和综合自然语言描述的守则方面有巨大的希望,然而,目前最先进的守则LMs(例如,Codex(Chen等人,2021年))尚未公布,从而留下了有关其模式和数据设计决定的许多问题。我们的目标是通过系统评估现有最大的模型(Codx、GPT-J、GPT-Neo、GPT-NeoX-20B和CodParrot)填补其中的一些空白。虽然Codex本身不是开放源,但我们发现现有的开放源代码LMMs(例如,Codex等人,2021年)确实在某些编程语言中取得了接近的结果,尽管主要针对自然语言建模。我们进一步找出了一个重要的缺失部分,其形式是专门进行多种语言代码组合培训的大型开放源模型。我们发布了一个新的模型(PolyCder),其中2.7B参数基于GPT-2结构,该模型在12种编程语言的249GB编码中经过培训,在单一机器上是开放的,在Mex/Mex/Sendorex/Fefrodrodroformax进行。

0

相关内容

代码（Code）是专知网的一个重要知识资料文档板块，旨在整理收录论文源代码、复现代码，经典工程代码等，便于用户查阅下载使用。

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

新型丙酮酸激酶M2亚型（PKM2）激动剂的设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型多孔复合材料

国家自然科学基金

0+阅读 · 2013年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

海洋生物碱aaptamines及其衍生物的设计、合成及生物活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

海洋天然产物Lamellarin D糖基化衍生物的合成与构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型多环芳烃Bisanthene衍生物的设计、合成与性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型非甾体类AR拮抗剂的设计合成及生物活性评价

国家自然科学基金

0+阅读 · 2012年12月31日

人胆固醇酯转运蛋白对胰岛β细胞和脂肪细胞胆固醇代谢的调节及糖代谢的影响

国家自然科学基金

0+阅读 · 2011年12月31日

滞回特性小的超磁致伸缩合金Tb-Dy-Ho-Fe系相图与合金的制备及其磁性研究

国家自然科学基金

0+阅读 · 2011年12月31日

铁电配合物的合成，结构与性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

A Study on the Evaluation of Generative Models

Arxiv

0+阅读 · 2022年6月22日

OPT: Open Pre-trained Transformer Language Models

Arxiv

0+阅读 · 2022年6月21日

Text Style Transfer: A Review and Experimental Evaluation

Arxiv

0+阅读 · 2022年6月21日

ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models

Arxiv

0+阅读 · 2022年6月19日

Evolution through Large Models

Evolution through Large Models

Arxiv

0+阅读 · 2022年6月17日

Evaluation of Contrastive Learning with Various Code Representations for Code Clone Detection

Arxiv

0+阅读 · 2022年6月17日

Evaluating the Impact of Source Code Parsers on ML4SE Models

Evaluating the Impact of Source Code Parsers on ML4SE Models

Arxiv

0+阅读 · 2022年6月17日

Using Transfer Learning for Code-Related Tasks

Arxiv

0+阅读 · 2022年6月17日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

A Survey of the Usages of Deep Learning in Natural Language Processing

A Survey of the Usages of Deep Learning in Natural Language Processing

Arxiv

122+阅读 · 2019年9月11日

VIP会员

文章信息

相关主题

语言模型化

相关VIP内容

最新《Transformers模型》教程，64页ppt

最新《Transformers模型》教程，64页ppt

专知会员服务

321+阅读 · 2020年11月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

征稿 | CFP：Special Issue of NLP and KG(JCR Q2，IF2.67)

开放知识图谱

1+阅读 · 2022年4月4日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

A Study on the Evaluation of Generative Models

Arxiv

0+阅读 · 2022年6月22日

OPT: Open Pre-trained Transformer Language Models

Arxiv

0+阅读 · 2022年6月21日

Text Style Transfer: A Review and Experimental Evaluation

Arxiv

0+阅读 · 2022年6月21日

ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models

Arxiv

0+阅读 · 2022年6月19日

Evolution through Large Models

Evolution through Large Models

Arxiv

0+阅读 · 2022年6月17日

Evaluation of Contrastive Learning with Various Code Representations for Code Clone Detection

Arxiv

0+阅读 · 2022年6月17日

Evaluating the Impact of Source Code Parsers on ML4SE Models

Evaluating the Impact of Source Code Parsers on ML4SE Models

Arxiv

0+阅读 · 2022年6月17日

Using Transfer Learning for Code-Related Tasks

Arxiv

0+阅读 · 2022年6月17日

Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing

Arxiv

30+阅读 · 2021年7月28日

A Survey of the Usages of Deep Learning in Natural Language Processing

A Survey of the Usages of Deep Learning in Natural Language Processing

Arxiv

122+阅读 · 2019年9月11日

相关基金

新型丙酮酸激酶M2亚型（PKM2）激动剂的设计、合成及构效关系研究

国家自然科学基金

0+阅读 · 2015年12月31日

新型多孔复合材料

国家自然科学基金

0+阅读 · 2013年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

海洋生物碱aaptamines及其衍生物的设计、合成及生物活性研究

国家自然科学基金

0+阅读 · 2013年12月31日

海洋天然产物Lamellarin D糖基化衍生物的合成与构效关系研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型多环芳烃Bisanthene衍生物的设计、合成与性质研究

国家自然科学基金

0+阅读 · 2013年12月31日

新型非甾体类AR拮抗剂的设计合成及生物活性评价

国家自然科学基金

0+阅读 · 2012年12月31日

人胆固醇酯转运蛋白对胰岛β细胞和脂肪细胞胆固醇代谢的调节及糖代谢的影响

国家自然科学基金

0+阅读 · 2011年12月31日

滞回特性小的超磁致伸缩合金Tb-Dy-Ho-Fe系相图与合金的制备及其磁性研究

国家自然科学基金

0+阅读 · 2011年12月31日

铁电配合物的合成，结构与性质研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员