添加源代码背景, 用于深学习的源代码代表 (Adding Context to Source Code Representations for Deep Learning) - 专知论文

会员服务 ·

0

Learning · 代码 · INFORMS · 深度学习 · MoDELS ·

2022 年 7 月 30 日

Adding Context to Source Code Representations for Deep Learning

翻译：添加源代码背景, 用于深学习的源代码代表

Fuwei Tian,Christoph Treude

from arxiv, Accepted for publication in the New Ideas and Emerging Results (NIER) track of the 38th IEEE International Conference on Software Maintenance and Evolution (ICSME 2022)

Deep learning models have been successfully applied to a variety of software engineering tasks, such as code classification, summarisation, and bug and vulnerability detection. In order to apply deep learning to these tasks, source code needs to be represented in a format that is suitable for input into the deep learning model. Most approaches to representing source code, such as tokens, abstract syntax trees (ASTs), data flow graphs (DFGs), and control flow graphs (CFGs) only focus on the code itself and do not take into account additional context that could be useful for deep learning models. In this paper, we argue that it is beneficial for deep learning models to have access to additional contextual information about the code being analysed. We present preliminary evidence that encoding context from the call hierarchy along with information from the code itself can improve the performance of a state-of-the-art deep learning model for two software engineering tasks. We outline our research agenda for adding further contextual information to source code representations for deep learning.

翻译：深层学习模型被成功地应用于各种软件工程任务,如代码分类、汇总、错误和脆弱性检测等。为了对这些任务进行深层次的学习,源代码需要以适合输入深层学习模式的格式表示。代表源代码的大多数方法,如象征、抽象语法树(ASTs)、数据流程图(DFGs)以及控制流程图(CFGs),只是侧重于代码本身,而没有考虑对深层学习模式有用的其他背景。在本文件中,我们提出深层学习模型获得关于正在分析的代码的其他背景信息是有益的。我们提出初步证据,说明调用层次的背景以及代码本身的信息可以改进两种软件工程任务的最新深层学习模式的性能。我们概述了我们为为源代码表达而增加更多背景信息以用于深层学习的研究议程。

0

相关内容

Learning

【MIT Sam Hopkins】如何读论文？How to Read a Paper

【MIT Sam Hopkins】如何读论文？How to Read a Paper

专知会员服务

108+阅读 · 2022年3月20日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于极化状态调制的OFDM移动通信系统物理层加密技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

锕系氮化物燃料的结构缺陷形成与扩散机理的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

绿色制造企业级能源与生产协调随机优化调度

国家自然科学基金

2+阅读 · 2014年12月31日

C/C复合材料在热力环境下的阻尼行为与损伤表征

国家自然科学基金

1+阅读 · 2013年12月31日

面向人类健康的体外诊察信息感知与计算方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

时空交互的统计建模

国家自然科学基金

0+阅读 · 2011年12月31日

基于多特征情感信息融合的高效率e-Learning关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

面向互联网舆情分析的文档自动摘要关键技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于概率图模型与序列蒙特卡洛方法的航天器自主故障诊断与健康监测技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

A model-agnostic approach for generating Saliency Maps to explain inferred decisions of Deep Learning Models

A model-agnostic approach for generating Saliency Maps to explain inferred decisions of Deep Learning Models

Arxiv

0+阅读 · 2022年9月27日

Advanced Skills by Learning Locomotion and Local Navigation End-to-End

Advanced Skills by Learning Locomotion and Local Navigation End-to-End

Arxiv

0+阅读 · 2022年9月26日

Mutual Contact Discovery

Arxiv

0+阅读 · 2022年9月24日

Visual Attention Methods in Deep Learning: An In-Depth Survey

Arxiv

44+阅读 · 2022年4月16日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

Deep contextualized word representations

Arxiv

10+阅读 · 2018年3月22日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

VIP会员

文章信息

相关主题

相关VIP内容

【MIT Sam Hopkins】如何读论文？How to Read a Paper

【MIT Sam Hopkins】如何读论文？How to Read a Paper

专知会员服务

108+阅读 · 2022年3月20日

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

Artificial Intelligence: Ready to Ride the Wave? BCG 28页PPT

专知会员服务

28+阅读 · 2022年2月20日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

BERT/注意力机制/Transformer/迁移学习NLP资源大列表：awesome-bert-nlp

AINLP

40+阅读 · 2019年6月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

A model-agnostic approach for generating Saliency Maps to explain inferred decisions of Deep Learning Models

A model-agnostic approach for generating Saliency Maps to explain inferred decisions of Deep Learning Models

Arxiv

0+阅读 · 2022年9月27日

Advanced Skills by Learning Locomotion and Local Navigation End-to-End

Advanced Skills by Learning Locomotion and Local Navigation End-to-End

Arxiv

0+阅读 · 2022年9月26日

Mutual Contact Discovery

Arxiv

0+阅读 · 2022年9月24日

Visual Attention Methods in Deep Learning: An In-Depth Survey

Arxiv

44+阅读 · 2022年4月16日

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Arxiv

11+阅读 · 2021年6月25日

Spatially Consistent Representation Learning

Arxiv

14+阅读 · 2021年3月10日

A Survey on Deep Learning for Named Entity Recognition

A Survey on Deep Learning for Named Entity Recognition

Arxiv

73+阅读 · 2018年12月22日

Event Extraction with Generative Adversarial Imitation Learning

Arxiv

13+阅读 · 2018年4月21日

Deep contextualized word representations

Arxiv

10+阅读 · 2018年3月22日

Deep Active Learning for Named Entity Recognition

Arxiv

15+阅读 · 2018年2月4日

相关基金

基于极化状态调制的OFDM移动通信系统物理层加密技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

锕系氮化物燃料的结构缺陷形成与扩散机理的理论研究

国家自然科学基金

0+阅读 · 2015年12月31日

绿色制造企业级能源与生产协调随机优化调度

国家自然科学基金

2+阅读 · 2014年12月31日

C/C复合材料在热力环境下的阻尼行为与损伤表征

国家自然科学基金

1+阅读 · 2013年12月31日

面向人类健康的体外诊察信息感知与计算方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

时空交互的统计建模

国家自然科学基金

0+阅读 · 2011年12月31日

基于多特征情感信息融合的高效率e-Learning关键技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

基于本体的Deep Web搜索技术

国家自然科学基金

2+阅读 · 2009年12月31日

面向互联网舆情分析的文档自动摘要关键技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

基于概率图模型与序列蒙特卡洛方法的航天器自主故障诊断与健康监测技术研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员