GitHub Copilot人工智能对手程序员：资产还是负责? (GitHub Copilot AI pair programmer: Asset or Liability?) - 专知论文

会员服务 ·

0

Copilot · 程序员 · GitHub · 编程 · 正确性 ·

2023 年 4 月 14 日

GitHub Copilot AI pair programmer: Asset or Liability?

翻译：GitHub Copilot人工智能对手程序员：资产还是负责?

Arghavan Moradi Dakhel,Vahid Majdinasab,Amin Nikanjam,Foutse Khomh,Michel C. Desmarais,Zhen Ming, Jiang

from arxiv, 27 pages, 8 figures

Automatic program synthesis is a long-lasting dream in software engineering. Recently, a promising Deep Learning (DL) based solution, called Copilot, has been proposed by OpenAI and Microsoft as an industrial product. Although some studies evaluate the correctness of Copilot solutions and report its issues, more empirical evaluations are necessary to understand how developers can benefit from it effectively. In this paper, we study the capabilities of Copilot in two different programming tasks: (i) generating (and reproducing) correct and efficient solutions for fundamental algorithmic problems, and (ii) comparing Copilot's proposed solutions with those of human programmers on a set of programming tasks. For the former, we assess the performance and functionality of Copilot in solving selected fundamental problems in computer science, like sorting and implementing data structures. In the latter, a dataset of programming problems with human-provided solutions is used. The results show that Copilot is capable of providing solutions for almost all fundamental algorithmic problems, however, some solutions are buggy and non-reproducible. Moreover, Copilot has some difficulties in combining multiple methods to generate a solution. Comparing Copilot to humans, our results show that the correct ratio of humans' solutions is greater than Copilot's suggestions, while the buggy solutions generated by Copilot require less effort to be repaired.

翻译：自动程序合成是软件工程中梦寐以求的目标。最近，OpenAI和Microsoft提出了一种基于深度学习的解决方案Copilot，作为一种工业产品。尽管一些研究评估了Copilot解决方案的正确性并报告了它的问题，但需要更多的经验性评估来理解开发人员如何有效地从中受益。在本文中，我们研究了Copilot在两个不同的编程任务中的能力：(i) 生成(和再现)基本算法问题的正确且高效解决方案，以及(ii) 将Copilot的提议解决方案与人类程序员的解决方案进行比较。对于前者，在计算机科学中选择的基本问题中评估了Copilot的性能和功能，例如排序和实现数据结构。对于后者，使用人类提供的解决方案的编程问题数据集。结果表明，Copilot能够为几乎所有基本算法问题提供解决方案，但一些解决方案是有缺陷且不可重现的。此外，Copilot有些难以将多种方法组合起来生成解决方案。将Copilot与人类进行比较，我们的结果显示人类的解决方案正确率大于Copilot的建议，而由Copilot生成的有缺陷的解决方案需要更少的修复工作。

0

相关内容

Copilot

【2023新书】正则表达式谜题和AI编码助手:解决了24个谜题，有或没有Copilot、ChatGPT等的帮助,147页pdf

【2023新书】正则表达式谜题和AI编码助手:解决了24个谜题，有或没有Copilot、ChatGPT等的帮助,147页pdf

专知会员服务

82+阅读 · 2023年3月8日

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

专知会员服务

58+阅读 · 2022年12月10日

【人工智能+人力资源】人力资源专业人士的工具箱，Human-Centred Artificial Intelligence for Human Resources: A Toolkit for Human Resources Professionals

【人工智能+人力资源】人力资源专业人士的工具箱，Human-Centred Artificial Intelligence for Human Resources: A Toolkit for Human Resources Professionals

专知会员服务

29+阅读 · 2022年2月17日

终究还是来了，AI卷革程序员！！DeepMind发布媲美普通程序员的AlphaCode

终究还是来了，AI卷革程序员！！DeepMind发布媲美普通程序员的AlphaCode

专知会员服务

27+阅读 · 2022年2月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

103+阅读 · 2020年6月21日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

让程序员动嘴写代码，Copilot测试新功能「嘿，GitHub！」

让程序员动嘴写代码，Copilot测试新功能「嘿，GitHub！」

机器之心

0+阅读 · 2022年11月10日

程序员早下班的编码神器 GitHub Copilot，遭 90 亿美元的集体诉讼！

程序员早下班的编码神器 GitHub Copilot，遭 90 亿美元的集体诉讼！

CSDN

1+阅读 · 2022年11月7日

八个不容错过的 GitHub Copilot 功能！

八个不容错过的 GitHub Copilot 功能！

CSDN

11+阅读 · 2022年9月22日

用 20+ 行 JavaScript 代码，短暂“变身” iOS 程序员！

用 20+ 行 JavaScript 代码，短暂“变身” iOS 程序员！

CSDN

0+阅读 · 2022年9月7日

AI帮写代码67元/月！

AI帮写代码67元/月！

夕小瑶的卖萌屋

0+阅读 · 2022年6月27日

CALDERA 一款对手自动模拟工具

CALDERA 一款对手自动模拟工具

黑白之道

20+阅读 · 2019年9月17日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

PyTorch自然语言处理实战（附详细代码下载）

PyTorch自然语言处理实战（附详细代码下载）

专知

67+阅读 · 2019年2月12日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

无线传感器网络分布式安全时钟同步算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

SR蛋白介导肿瘤发生的功能和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

化痰通脉饮对PCOS的IRS-1-PI3K/AKT/NF-κB串流失控的调节效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

金融与管理中的HJB方程组的高效有限元方法

国家自然科学基金

0+阅读 · 2013年12月31日

自组装纳米颗粒单层膜／溅射薄膜异质结构的制备与磁光性质的研究

国家自然科学基金

0+阅读 · 2013年12月31日

交互式Petri网及其兼容性研究

国家自然科学基金

0+阅读 · 2012年12月31日

ATM介导自噬分子Beclin1磷酸化修饰的新功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

NUAK1介导LKB1信号通路与PTEN信号通路相互作用的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于iPDMS和SI-ATRP的静电纺丝表面功能化研究

国家自然科学基金

0+阅读 · 2009年12月31日

语言环境下群体共识过程的优化研究

国家自然科学基金

0+阅读 · 2008年12月31日

Is Model Attention Aligned with Human Attention? An Empirical Study on Large Language Models for Code Generation

Arxiv

0+阅读 · 2023年6月2日

A New Algebraic Approach for String Reconstruction from Substring Compositions

Arxiv

0+阅读 · 2023年6月1日

Automatic Emotion Experiencer Recognition

Arxiv

0+阅读 · 2023年6月1日

AI for Low-Code for AI

Arxiv

1+阅读 · 2023年5月31日

Reduced order models for the buckling of hyperelastic beams

Arxiv

0+阅读 · 2023年5月31日

AI for Next Generation Computing: Emerging Trends and Future Directions

Arxiv

19+阅读 · 2022年3月5日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Medical Visual Question Answering: A Survey

Arxiv

15+阅读 · 2021年11月19日

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Arxiv

16+阅读 · 2020年8月10日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】正则表达式谜题和AI编码助手:解决了24个谜题，有或没有Copilot、ChatGPT等的帮助,147页pdf

【2023新书】正则表达式谜题和AI编码助手:解决了24个谜题，有或没有Copilot、ChatGPT等的帮助,147页pdf

专知会员服务

82+阅读 · 2023年3月8日

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

【干货书】Python强化学习算法:学习、理解和开发智能算法以应对人工智能挑战，356页pdf，附代码

专知会员服务

58+阅读 · 2022年12月10日

【人工智能+人力资源】人力资源专业人士的工具箱，Human-Centred Artificial Intelligence for Human Resources: A Toolkit for Human Resources Professionals

【人工智能+人力资源】人力资源专业人士的工具箱，Human-Centred Artificial Intelligence for Human Resources: A Toolkit for Human Resources Professionals

专知会员服务

29+阅读 · 2022年2月17日

终究还是来了，AI卷革程序员！！DeepMind发布媲美普通程序员的AlphaCode

终究还是来了，AI卷革程序员！！DeepMind发布媲美普通程序员的AlphaCode

专知会员服务

27+阅读 · 2022年2月3日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

103+阅读 · 2020年6月21日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

TensorFlow 2.0 学习资源汇总

TensorFlow 2.0 学习资源汇总

专知会员服务

67+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

视觉-语言-动作模型解析：从模块构成到里程碑与挑战

《解析陆域作战方向：一个概念性框架》报告

【博士论文】基于多模态基础模型的上下文学习

追寻真正的AI自主性：从遗留思维到战场优势

相关资讯

让程序员动嘴写代码，Copilot测试新功能「嘿，GitHub！」

让程序员动嘴写代码，Copilot测试新功能「嘿，GitHub！」

机器之心

0+阅读 · 2022年11月10日

程序员早下班的编码神器 GitHub Copilot，遭 90 亿美元的集体诉讼！

程序员早下班的编码神器 GitHub Copilot，遭 90 亿美元的集体诉讼！

CSDN

1+阅读 · 2022年11月7日

八个不容错过的 GitHub Copilot 功能！

八个不容错过的 GitHub Copilot 功能！

CSDN

11+阅读 · 2022年9月22日

用 20+ 行 JavaScript 代码，短暂“变身” iOS 程序员！

用 20+ 行 JavaScript 代码，短暂“变身” iOS 程序员！

CSDN

0+阅读 · 2022年9月7日

AI帮写代码67元/月！

AI帮写代码67元/月！

夕小瑶的卖萌屋

0+阅读 · 2022年6月27日

CALDERA 一款对手自动模拟工具

CALDERA 一款对手自动模拟工具

黑白之道

20+阅读 · 2019年9月17日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

RoBERTa for Chinese：大规模中文预训练RoBERTa模型

AINLP

30+阅读 · 2019年9月8日

PyTorch自然语言处理实战（附详细代码下载）

PyTorch自然语言处理实战（附详细代码下载）

专知

67+阅读 · 2019年2月12日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Is Model Attention Aligned with Human Attention? An Empirical Study on Large Language Models for Code Generation

Arxiv

0+阅读 · 2023年6月2日

A New Algebraic Approach for String Reconstruction from Substring Compositions

Arxiv

0+阅读 · 2023年6月1日

Automatic Emotion Experiencer Recognition

Arxiv

0+阅读 · 2023年6月1日

AI for Low-Code for AI

Arxiv

1+阅读 · 2023年5月31日

Reduced order models for the buckling of hyperelastic beams

Arxiv

0+阅读 · 2023年5月31日

AI for Next Generation Computing: Emerging Trends and Future Directions

Arxiv

19+阅读 · 2022年3月5日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Medical Visual Question Answering: A Survey

Arxiv

15+阅读 · 2021年11月19日

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Beyond Lexical: A Semantic Retrieval Framework for Textual SearchEngine

Arxiv

16+阅读 · 2020年8月10日

Label-aware Double Transfer Learning for Cross-Specialty Medical Named Entity Recognition

Arxiv

10+阅读 · 2018年4月28日

相关基金

无线传感器网络分布式安全时钟同步算法研究

国家自然科学基金

0+阅读 · 2014年12月31日

SR蛋白介导肿瘤发生的功能和机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

化痰通脉饮对PCOS的IRS-1-PI3K/AKT/NF-κB串流失控的调节效应研究

国家自然科学基金

0+阅读 · 2013年12月31日

金融与管理中的HJB方程组的高效有限元方法

国家自然科学基金

0+阅读 · 2013年12月31日

自组装纳米颗粒单层膜／溅射薄膜异质结构的制备与磁光性质的研究

国家自然科学基金

0+阅读 · 2013年12月31日

交互式Petri网及其兼容性研究

国家自然科学基金

0+阅读 · 2012年12月31日

ATM介导自噬分子Beclin1磷酸化修饰的新功能解析

国家自然科学基金

0+阅读 · 2012年12月31日

NUAK1介导LKB1信号通路与PTEN信号通路相互作用的分子机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于iPDMS和SI-ATRP的静电纺丝表面功能化研究

国家自然科学基金

0+阅读 · 2009年12月31日

语言环境下群体共识过程的优化研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员