CodeGen-测试:自动代码生成模型整合程序测试信息 (CodeGen-Test: An Automatic Code Generation Model Integrating Program Test Information) - 专知论文

会员服务 ·

0

INFORMS · MoDELS · Integration · 生成模型 · Conformer ·

2022 年 2 月 14 日

CodeGen-Test: An Automatic Code Generation Model Integrating Program Test Information

翻译：CodeGen-测试:自动代码生成模型整合程序测试信息

Maosheng Zhong,Gen Liu,Hongwei Li,Jiangling Kuang,Jinshan Zeng,Mingwen Wang

from arxiv, 10 paper pages, 7 figures; 2 appendix pages, 5 appendix figures

Automatic code generation is to generate the program code according to the given natural language description. The current mainstream approach uses neural networks to encode natural language descriptions, and output abstract syntax trees (AST) at the decoder, then convert the AST into program code. While the generated code largely conforms to specific syntax rules, two problems are still ignored. One is missing program testing, an essential step in the process of complete code implementation; the other is only focusing on the syntax compliance of the generated code, while ignoring the more important program functional requirements. The paper proposes a CodeGen-Test model, which adds program testing steps and incorporates program testing information to iteratively generate code that meets the functional requirements of the program, thereby improving the quality of code generation. At the same time, the paper proposes a new evaluation metric, test accuracy (Test-Acc), which represents the proportion of passing program test in generated code. Different from the previous evaluation metric, which only evaluates the quality of code generation from the perspective of character similarity, the Test-Acc can evaluate the quality of code generation from the Program functions. Moreover, the paper evaluates the CodeGen-test model on a python data set "hearthstone legend". The experimental results show the proposed method can effectively improve the quality of generated code. Compared with the existing optimal model, CodeGen-Test model improves the Bleu value by 0.2%, Rouge-L value by 0.3% and Test-Acc by 6%.

翻译：自动代码生成是根据给定的自然语言描述生成程序代码。当前的主流方法使用神经网络在解码器中编码自然语言描述和输出抽象语法树( AST), 然后将 AST 转换为程序代码。虽然生成的代码基本上符合具体的语法规则, 但有两个问题仍然被忽视。一个是缺少程序测试, 这是完整代码执行过程中一个必不可少的步骤; 另一个只是侧重于生成代码的语法合规性, 而忽略了更为重要的程序功能要求。本文提议了一个 CodeG- 测试模型, 添加程序测试步骤, 并纳入程序测试信息, 以迭代生成符合程序功能要求的代码, 从而改进代码生成的质量。同时, 该文件还提出了一个新的评价标准、测试准确性( Test- Acc), 代表了生成代码中通过程序测试的比例。不同于以前的评估度度度度, 仅从性质相似的角度评估代码生成质量, 测试- Acc 能够评估程序功能生成的代码质量质量。此外, 实验性测试G 能够通过测试模型改进现有的测试方法。

1

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

专知会员服务

37+阅读 · 2021年1月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

VIP中间神经元失抑制效应在MCD痫性放电自限性受损中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于f-x域正则化自回归的非稳态地震数据重建和噪声衰减研究

国家自然科学基金

0+阅读 · 2013年12月31日

虫草素及甘草素定向合成产物促人肝癌细胞凋亡调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

不确定耦合PDE-ODE系统的自适应镇定

国家自然科学基金

0+阅读 · 2013年12月31日

稀疏框架下信号瞬态成分提取及其机械故障预示研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于LDPC码的压缩感知测量矩阵构造及性能分析

国家自然科学基金

0+阅读 · 2012年12月31日

低秩矩阵复原的Schatten-q(0<q<1)正则化理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Treelet变换的多时相SAR图像变化检测

国家自然科学基金

0+阅读 · 2009年12月31日

ROS信号通路与AsA参与棉花纤维细胞发育的作用机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

Model Reduction via Dynamic Mode Decomposition

Model Reduction via Dynamic Mode Decomposition

Arxiv

0+阅读 · 2022年4月20日

Sound-Guided Semantic Video Generation

Arxiv

0+阅读 · 2022年4月20日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Arxiv

0+阅读 · 2022年4月18日

InCoder: A Generative Model for Code Infilling and Synthesis

Arxiv

0+阅读 · 2022年4月17日

What If: Generating Code to Answer Simulation Questions

Arxiv

0+阅读 · 2022年4月16日

Beyond L1: Faster and Better Sparse Models with skglm

Arxiv

0+阅读 · 2022年4月16日

Graph-incorporated Latent Factor Analysis for High-dimensional and Sparse Matrices

Arxiv

0+阅读 · 2022年4月16日

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words

Arxiv

0+阅读 · 2022年4月16日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

专知会员服务

37+阅读 · 2021年1月15日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

扩散模型中的 Transformer：图像生成及其延展应用询问 ChatGPT

281页pdf《神经网络设计入门》

【普林斯顿博士论文】以奖励推动生成式人工智能的发展：奖励引导生成的理论与方法

中文版 | 火力支援与巡飞弹药的未来（附原文）

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Model Reduction via Dynamic Mode Decomposition

Model Reduction via Dynamic Mode Decomposition

Arxiv

0+阅读 · 2022年4月20日

Sound-Guided Semantic Video Generation

Arxiv

0+阅读 · 2022年4月20日

Auto-Icon+: An Automated End-to-End Code Generation Tool for Icon Designs in UI Development

Arxiv

0+阅读 · 2022年4月19日

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Modx: Binary Level Partial Imported Third-Party Library Detection through Program Modularization and Semantic Matching

Arxiv

0+阅读 · 2022年4月18日

InCoder: A Generative Model for Code Infilling and Synthesis

Arxiv

0+阅读 · 2022年4月17日

What If: Generating Code to Answer Simulation Questions

Arxiv

0+阅读 · 2022年4月16日

Beyond L1: Faster and Better Sparse Models with skglm

Arxiv

0+阅读 · 2022年4月16日

Graph-incorporated Latent Factor Analysis for High-dimensional and Sparse Matrices

Arxiv

0+阅读 · 2022年4月16日

SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words

Arxiv

0+阅读 · 2022年4月16日

Prime Sample Attention in Object Detection

Arxiv

13+阅读 · 2019年4月9日

相关基金

VIP中间神经元失抑制效应在MCD痫性放电自限性受损中的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

TMS1基因响应高温胁迫和ER Stress的分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

基于f-x域正则化自回归的非稳态地震数据重建和噪声衰减研究

国家自然科学基金

0+阅读 · 2013年12月31日

虫草素及甘草素定向合成产物促人肝癌细胞凋亡调控机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

不确定耦合PDE-ODE系统的自适应镇定

国家自然科学基金

0+阅读 · 2013年12月31日

稀疏框架下信号瞬态成分提取及其机械故障预示研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于LDPC码的压缩感知测量矩阵构造及性能分析

国家自然科学基金

0+阅读 · 2012年12月31日

低秩矩阵复原的Schatten-q(0<q<1)正则化理论与算法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于Treelet变换的多时相SAR图像变化检测

国家自然科学基金

0+阅读 · 2009年12月31日

ROS信号通路与AsA参与棉花纤维细胞发育的作用机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员