自动程序修复的领域知识提炼树解码器：KNOD (KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair) - 专知论文

会员服务 ·

0

领域知识 · 解码 · Bug · UG · OD ·

2023 年 4 月 16 日

KNOD: Domain Knowledge Distilled Tree Decoder for Automated Program Repair

翻译：自动程序修复的领域知识提炼树解码器：KNOD

Nan Jiang,Thibaud Lutellier,Yiling Lou,Lin Tan,Dan Goldwasser,Xiangyu Zhang

from arxiv, This paper is accepted by 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE)

Automated Program Repair (APR) improves software reliability by generating patches for a buggy program automatically. Recent APR techniques leverage deep learning (DL) to build models to learn to generate patches from existing patches and code corpora. While promising, DL-based APR techniques suffer from the abundant syntactically or semantically incorrect patches in the patch space. These patches often disobey the syntactic and semantic domain knowledge of source code and thus cannot be the correct patches to fix a bug. We propose a DL-based APR approach KNOD, which incorporates domain knowledge to guide patch generation in a direct and comprehensive way. KNOD has two major novelties, including (1) a novel three-stage tree decoder, which directly generates Abstract Syntax Trees of patched code according to the inherent tree structure, and (2) a novel domain-rule distillation, which leverages syntactic and semantic rules and teacher-student distributions to explicitly inject the domain knowledge into the decoding procedure during both the training and inference phases. We evaluate KNOD on three widely-used benchmarks. KNOD fixes 72 bugs on the Defects4J v1.2, 25 bugs on the QuixBugs, and 50 bugs on the additional Defects4J v2.0 benchmarks, outperforming all existing APR tools.

翻译：自动程序修复（APR）通过自动生成补丁修复程序中存在的故障自动来提高软件可靠性。最近的 APR 技术利用深度学习（DL）构建模型，从现有的补丁和代码库中学习生成补丁。尽管有前途，DL-Based 的 APR 技术仍受制于补丁空间中大量的语法或语义不正确的补丁。这些补丁往往违反了源代码的语法和语义领域知识，因此不能成为修复 bug 的正确补丁。我们提出了一种 DL-Based 的 APR 方法 KNOD，它将领域知识直接而全面地引导到补丁生成中。KNOD 有两个主要的创新点，包括（1）一种新颖的三级树解码器，根据内在的树结构直接生成已修补代码的抽象语法树，以及（2）一种新颖的领域规则提取技术，利用语法和语义规则和教师 - 学生分布，将领域知识显式注入到训练和推理过程中的解码过程中。我们在三个广泛使用的基准测试上评估了 KNOD。KNOD 修复了 Defects4Jv1.2 上的 72 个 bug，QuixBugs 上的 25 个 bug，以及其他 Defects4Jv2.0 基准测试上的 50 个 bug，优于所有现有的 APR 工具。

0

相关内容

领域知识

领域知识：特定行业，方向的专业知识。

大模型全面阐述，448页新书《基础模型自然语言处理》，详述大模型在信息提取文本生成视觉语音应用

大模型全面阐述，448页新书《基础模型自然语言处理》，详述大模型在信息提取文本生成视觉语音应用

专知会员服务

180+阅读 · 2023年5月27日

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

【CVPR2021】基于反事实推断的视觉问答框架

【CVPR2021】基于反事实推断的视觉问答框架

专知会员服务

27+阅读 · 2021年3月4日

【MIT】硬负样本的对比学习

【MIT】硬负样本的对比学习

专知会员服务

40+阅读 · 2020年10月14日

【微软】利用知识图谱提高抽象摘要的事实正确性，Boosting Factual Correctness

专知会员服务

18+阅读 · 2020年3月23日

PaperRobot: Automated Scientific Knowledge Graph Construction and Paper Writing，伊利诺伊大学香槟分校计算机科学系Heng Ji教授，CCKS-2019：知识智能

PaperRobot: Automated Scientific Knowledge Graph Construction and Paper Writing，伊利诺伊大学香槟分校计算机科学系Heng Ji教授，CCKS-2019：知识智能

专知会员服务

32+阅读 · 2019年10月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新6篇图像描述生成相关论文—语言为枢纽、细粒度、生成器、注意力机制、策略梯度优化、判别性目标

【论文推荐】最新6篇图像描述生成相关论文—语言为枢纽、细粒度、生成器、注意力机制、策略梯度优化、判别性目标

专知

11+阅读 · 2018年3月20日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

DNA修复基因3'-UTR遗传变异与肝细胞肝癌发生风险及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

模型驱动的移动应用测试方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于高效预测模型的原核精细调控元件理性设计

国家自然科学基金

0+阅读 · 2013年12月31日

基于概率时间自动机的移动机器人运动规划方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

缺血脑损伤中TRPM7/ChaK1介导神经元Annexin 1膜转位及分泌在小胶质细胞活化中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

miR-135a调控TRPC1在糖尿病肾病发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

羊草抗盐相关转录因子基因资源的挖掘

国家自然科学基金

0+阅读 · 2009年12月31日

膀胱癌DNA修复基因XPC高甲基化导致基因沉默的作用与机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

Domain Knowledge Matters: Improving Prompts with Fix Templates for Repairing Python Type Errors

Arxiv

0+阅读 · 2023年6月2日

Multiscale Positive-Unlabeled Detection of AI-Generated Texts

Arxiv

0+阅读 · 2023年6月2日

FREPA: An Automated and Formal Approach to Requirement Modeling and Analysis in Aircraft Control Domain

Arxiv

0+阅读 · 2023年6月2日

From Babel to Boole: The Logical Organization of Information Decompositions

Arxiv

0+阅读 · 2023年6月1日

Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Arxiv

0+阅读 · 2023年6月1日

PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis

Arxiv

0+阅读 · 2023年5月31日

Domain knowledge-informed Synthetic fault sample generation with Health Data Map for cross-domain Planetary Gearbox Fault Diagnosis

Arxiv

0+阅读 · 2023年5月31日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

17+阅读 · 2018年6月1日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

23+阅读 · 2018年3月22日

VIP会员

文章信息

相关主题

相关VIP内容

大模型全面阐述，448页新书《基础模型自然语言处理》，详述大模型在信息提取文本生成视觉语音应用

大模型全面阐述，448页新书《基础模型自然语言处理》，详述大模型在信息提取文本生成视觉语音应用

专知会员服务

180+阅读 · 2023年5月27日

百篇论文纵览大型语言模型最新研究进展

百篇论文纵览大型语言模型最新研究进展

专知会员服务

70+阅读 · 2023年3月31日

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

【ACL2022-华盛顿大学】生成知识促进常识推理，Generated Knowledge Prompting for Commonsense Reasoning

专知会员服务

26+阅读 · 2022年3月1日

【CVPR2021】基于反事实推断的视觉问答框架

【CVPR2021】基于反事实推断的视觉问答框架

专知会员服务

27+阅读 · 2021年3月4日

【MIT】硬负样本的对比学习

【MIT】硬负样本的对比学习

专知会员服务

40+阅读 · 2020年10月14日

【微软】利用知识图谱提高抽象摘要的事实正确性，Boosting Factual Correctness

专知会员服务

18+阅读 · 2020年3月23日

PaperRobot: Automated Scientific Knowledge Graph Construction and Paper Writing，伊利诺伊大学香槟分校计算机科学系Heng Ji教授，CCKS-2019：知识智能

PaperRobot: Automated Scientific Knowledge Graph Construction and Paper Writing，伊利诺伊大学香槟分校计算机科学系Heng Ji教授，CCKS-2019：知识智能

专知会员服务

32+阅读 · 2019年10月25日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

【论文推荐】最新七篇知识图谱相关论文—嵌入式知识、Zero-shot识别、知识图谱嵌入、网络库、变分推理、解释、弱监督

专知

19+阅读 · 2018年3月26日

【论文推荐】最新6篇图像描述生成相关论文—语言为枢纽、细粒度、生成器、注意力机制、策略梯度优化、判别性目标

【论文推荐】最新6篇图像描述生成相关论文—语言为枢纽、细粒度、生成器、注意力机制、策略梯度优化、判别性目标

专知

11+阅读 · 2018年3月20日

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

【论文推荐】最新五篇命名实体识别（NER）相关论文—对抗学习、语料库、深度多任务学习、先验知识、跨语言语义

专知

37+阅读 · 2018年2月21日

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

【论文推荐】最新5篇图像描述生成（Image Caption）相关论文—情感、注意力机制、遥感图像、序列到序列、深度神经结构

专知

66+阅读 · 2018年1月31日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Domain Knowledge Matters: Improving Prompts with Fix Templates for Repairing Python Type Errors

Arxiv

0+阅读 · 2023年6月2日

Multiscale Positive-Unlabeled Detection of AI-Generated Texts

Arxiv

0+阅读 · 2023年6月2日

FREPA: An Automated and Formal Approach to Requirement Modeling and Analysis in Aircraft Control Domain

Arxiv

0+阅读 · 2023年6月2日

From Babel to Boole: The Logical Organization of Information Decompositions

Arxiv

0+阅读 · 2023年6月1日

Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Arxiv

0+阅读 · 2023年6月1日

PERFOGRAPH: A Numerical Aware Program Graph Representation for Performance Optimization and Program Analysis

Arxiv

0+阅读 · 2023年5月31日

Domain knowledge-informed Synthetic fault sample generation with Health Data Map for cross-domain Planetary Gearbox Fault Diagnosis

Arxiv

0+阅读 · 2023年5月31日

Pre-Trained Models: Past, Present and Future

Arxiv

19+阅读 · 2021年6月15日

A Survey of Domain Adaptation for Neural Machine Translation

Arxiv

17+阅读 · 2018年6月1日

Learning over Knowledge-Base Embeddings for Recommendation

Arxiv

23+阅读 · 2018年3月22日

相关基金

DNA修复基因3'-UTR遗传变异与肝细胞肝癌发生风险及其机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

模型驱动的移动应用测试方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

基于高效预测模型的原核精细调控元件理性设计

国家自然科学基金

0+阅读 · 2013年12月31日

基于概率时间自动机的移动机器人运动规划方法研究

国家自然科学基金

1+阅读 · 2013年12月31日

面向属性的CPN建模及On the Fly辅助的测试生成方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

缺血脑损伤中TRPM7/ChaK1介导神经元Annexin 1膜转位及分泌在小胶质细胞活化中的作用

国家自然科学基金

0+阅读 · 2011年12月31日

miR-135a调控TRPC1在糖尿病肾病发病中的作用及机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

羊草抗盐相关转录因子基因资源的挖掘

国家自然科学基金

0+阅读 · 2009年12月31日

膀胱癌DNA修复基因XPC高甲基化导致基因沉默的作用与机制研究

国家自然科学基金

0+阅读 · 2009年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员