基于事件嵌入和事件依赖的代码克隆探测 (Code Clone Detection based on Event Embedding and Event Dependency) - 专知论文

会员服务 ·

0

INFORMS · 相似度 · 语义相似度 · MoDELS · 向量化 ·

2021 年 11 月 28 日

Code Clone Detection based on Event Embedding and Event Dependency

翻译：基于事件嵌入和事件依赖的代码克隆探测

Cheng Huang,Hui Zhou,Chunyang Ye,Bingzhuo Li

The code clone detection method based on semantic similarity has important value in software engineering tasks (e.g., software evolution, software reuse). Traditional code clone detection technologies pay more attention to the similarity of code at the syntax level, and less attention to the semantic similarity of the code. As a result, candidate codes similar in semantics are ignored. To address this issue, we propose a code clone detection method based on semantic similarity. By treating code as a series of interdependent events that occur continuously, we design a model namely EDAM to encode code semantic information based on event embedding and event dependency. The EDAM model uses the event embedding method to model the execution characteristics of program statements and the data dependence information between all statements. In this way, we can embed the program semantic information into a vector and use the vector to detect codes similar in semantics. Experimental results show that the performance of our EDAM model is superior to state of-the-art open source models for code clone detection.

翻译：以语义相似性为基础的代码克隆检测方法在软件工程任务(如软件演化、软件再利用)中具有重要价值; 传统代码克隆检测技术更加注意语法层面代码的相似性,而较少注意代码的语义相似性; 因此,在语义相似性中,候选代码被忽略; 为了解决这一问题, 我们提出了一个基于语义相似性的代码检测方法。通过将代码作为一系列不断发生的相互依存事件处理, 我们设计了一个模型, 即 EDAM 来根据事件嵌入和事件依赖性来编码语义信息。 EDAM 模型使用事件嵌入方法来模拟程序语句的执行特性和所有语句之间数据依赖性信息。这样, 我们可以将程序语义信息嵌入向矢量中, 并使用矢量来检测语义相似的代码。实验结果表明, 我们的EDAM 模型的性能优于用于代码克隆检测的开放源模型。

0

相关内容

INFORMS

《计算机信息》杂志发表高质量的论文，扩大了运筹学和计算的范围，寻求有关理论、方法、实验、系统和应用方面的原创研究论文、新颖的调查和教程论文，以及描述新的和有用的软件工具的论文。官网链接：https://pubsonline.informs.org/journal/ijoc

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【快讯】CVPR2020结果出炉，1470篇上榜，你的paper中了吗？

【快讯】CVPR2020结果出炉，1470篇上榜，你的paper中了吗？

专知会员服务

51+阅读 · 2020年2月24日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

118+阅读 · 2020年2月3日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

计算机 | 国际会议信息5条

计算机 | 国际会议信息5条

Call4Papers

3+阅读 · 2019年7月3日

CVPR2019年热门论文及开源代码分享

CVPR2019年热门论文及开源代码分享

深度学习与NLP

7+阅读 · 2019年6月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

【资源推荐】情感分析资源列表

【资源推荐】情感分析资源列表

专知

31+阅读 · 2019年3月20日

综述 | 事件抽取及推理 (上)

综述 | 事件抽取及推理 (上)

开放知识图谱

87+阅读 · 2019年1月9日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

Automatic Audio Captioning using Attention weighted Event based Embeddings

Arxiv

0+阅读 · 2022年1月28日

Time-Series Event Prediction with Evolutionary State Graph

Arxiv

14+阅读 · 2020年11月25日

Few-shot acoustic event detection via meta-learning

Arxiv

26+阅读 · 2020年2月21日

Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection

Arxiv

20+阅读 · 2019年10月25日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Android Malware Detection using Large-scale Network Representation Learning

Arxiv

4+阅读 · 2018年12月11日

Object Relation Detection Based on One-shot Learning

Object Relation Detection Based on One-shot Learning

Arxiv

3+阅读 · 2018年7月16日

Generating Fine-Grained Open Vocabulary Entity Type Descriptions

Arxiv

4+阅读 · 2018年5月27日

Few-Example Object Detection with Model Communication

Arxiv

7+阅读 · 2018年2月14日

An Attention-Based Word-Level Interaction Model: Relation Detection for Knowledge Base Question Answering

Arxiv

6+阅读 · 2018年1月30日

VIP会员

文章信息

相关主题

语义相似度

相关VIP内容

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

【KDD2020】动态图的拉普拉斯变换点检测，Laplacian Change Point Detection for Dynamic Graphs

专知会员服务

38+阅读 · 2020年7月3日

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

【Mila-Google】使用元学习动态调整源代码模型，On-the-Fly Adaptation of Source Code Models using Meta-Learning

专知会员服务

21+阅读 · 2020年3月28日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

抢鲜看！13篇CVPR2020论文链接/开源代码/解读

专知会员服务

50+阅读 · 2020年2月26日

【快讯】CVPR2020结果出炉，1470篇上榜，你的paper中了吗？

【快讯】CVPR2020结果出炉，1470篇上榜，你的paper中了吗？

专知会员服务

51+阅读 · 2020年2月24日

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

【东大-UCSB】虚假新闻检测的自然语言处理研究综述，A Survey on Natural Language Processing for Fake News Detection

专知会员服务

79+阅读 · 2020年2月12日

Transformer文本分类代码

Transformer文本分类代码

专知会员服务

118+阅读 · 2020年2月3日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《无人机战争时代的战时法：大国竞争中的区分原则、相称性原则与行动建议》最新75页

《构建强健军事力量的设计挑战：提升海军兵力支持系统效能的多分辨率建模方法》69页

正视无人机心理战：恐惧效应与战略反思

《精确反蜂群防御系统：三维运动探测与定向空爆拦截技术融合》最新24页

相关资讯

异常检测（Anomaly Detection）综述

异常检测（Anomaly Detection）综述

极市平台

20+阅读 · 2020年10月24日

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

【ACL2020放榜!】事件抽取、关系抽取、NER、Few-Shot 相关论文整理

深度学习自然语言处理

18+阅读 · 2020年5月22日

计算机 | 国际会议信息5条

计算机 | 国际会议信息5条

Call4Papers

3+阅读 · 2019年7月3日

CVPR2019年热门论文及开源代码分享

CVPR2019年热门论文及开源代码分享

深度学习与NLP

7+阅读 · 2019年6月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

基于PyTorch/TorchText的自然语言处理库

基于PyTorch/TorchText的自然语言处理库

专知

28+阅读 · 2019年4月22日

【资源推荐】情感分析资源列表

【资源推荐】情感分析资源列表

专知

31+阅读 · 2019年3月20日

综述 | 事件抽取及推理 (上)

综述 | 事件抽取及推理 (上)

开放知识图谱

87+阅读 · 2019年1月9日

【推荐】自然语言处理（NLP）指南

【推荐】自然语言处理（NLP）指南

机器学习研究会

35+阅读 · 2017年11月17日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

相关论文

Automatic Audio Captioning using Attention weighted Event based Embeddings

Arxiv

0+阅读 · 2022年1月28日

Time-Series Event Prediction with Evolutionary State Graph

Arxiv

14+阅读 · 2020年11月25日

Few-shot acoustic event detection via meta-learning

Arxiv

26+阅读 · 2020年2月21日

Meta-Learning with Dynamic-Memory-Based Prototypical Network for Few-Shot Event Detection

Arxiv

20+阅读 · 2019年10月25日

Reverse Attention for Salient Object Detection

Arxiv

11+阅读 · 2019年4月15日

Android Malware Detection using Large-scale Network Representation Learning

Arxiv

4+阅读 · 2018年12月11日

Object Relation Detection Based on One-shot Learning

Object Relation Detection Based on One-shot Learning

Arxiv

3+阅读 · 2018年7月16日

Generating Fine-Grained Open Vocabulary Entity Type Descriptions

Arxiv

4+阅读 · 2018年5月27日

Few-Example Object Detection with Model Communication

Arxiv

7+阅读 · 2018年2月14日

An Attention-Based Word-Level Interaction Model: Relation Detection for Knowledge Base Question Answering

Arxiv

6+阅读 · 2018年1月30日

微信扫码咨询专知VIP会员