MISIM: 使用上下文软件语义结构的神经编码语义相似系统 (MISIM: A Neural Code Semantics Similarity System Using the Context-Aware Semantics Structure) - 专知论文

会员服务 ·

0

语义相似度 · 相似度 · Extensibility · 模型评估 · Automator ·

2021 年 6 月 2 日

MISIM: A Neural Code Semantics Similarity System Using the Context-Aware Semantics Structure

翻译：MISIM: 使用上下文软件语义结构的神经编码语义相似系统

Fangke Ye,Shengtian Zhou,Anand Venkat,Ryan Marcus,Nesime Tatbul,Jesmin Jahan Tithi,Niranjan Hasabnis,Paul Petersen,Timothy Mattson,Tim Kraska,Pradeep Dubey,Vivek Sarkar,Justin Gottschlich

from arxiv, arXiv admin note: text overlap with arXiv:2003.11118

Code semantics similarity can be used for many tasks such as code recommendation, automated software defect correction, and clone detection. Yet, the accuracy of such systems has not yet reached a level of general purpose reliability. To help address this, we present Machine Inferred Code Similarity (MISIM), a neural code semantics similarity system consisting of two core components: (i)MISIM uses a novel context-aware semantics structure, which was purpose-built to lift semantics from code syntax; (ii)MISIM uses an extensible neural code similarity scoring algorithm, which can be used for various neural network architectures with learned parameters. We compare MISIM to four state-of-the-art systems, including two additional hand-customized models, over 328K programs consisting of over 18 million lines of code. Our experiments show that MISIM has 8.08% better accuracy (using MAP@R) compared to the next best performing system.

翻译：代码语义相似性可用于许多任务,例如代码建议、自动软件缺陷校正和克隆检测。然而,这些系统的准确性尚未达到一般目的可靠性的水平。为了解决这一问题,我们提出一个神经代码语义相似性系统,由两个核心部分组成:(i) MISIM使用一种新的符合背景的语义结构,目的是将语义从代码语义中去除;(ii) MISIM使用一种可扩展的神经代码相似性评分算法,可用于各种具有学习参数的神经网络结构。我们将MISIM比作四个最先进的系统,包括另外两个手定制的模型,超过由1 800万行代码组成的328K程序。我们的实验表明,MISIM比下一个最佳运行系统精准8.08%(使用MAP@R)。

0

相关内容

语义相似度

语义相似度

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

SIGIR2021接受论文列表公布！151篇论文都在这了！

专知会员服务

38+阅读 · 2021年4月27日

【斯坦福大学】矩阵对策的协调方法，89页pdf

【斯坦福大学】矩阵对策的协调方法，89页pdf

专知会员服务

27+阅读 · 2020年9月18日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：CCF TPCI 的推荐系统专刊征稿

LibRec 精选：CCF TPCI 的推荐系统专刊征稿

LibRec智能推荐

4+阅读 · 2019年1月12日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

LibRec 每周精选：10篇每个人都应该读的RecSys文章

LibRec 每周精选：10篇每个人都应该读的RecSys文章

LibRec智能推荐

5+阅读 · 2018年1月1日

【推荐】SLAM相关资源大列表

【推荐】SLAM相关资源大列表

机器学习研究会

10+阅读 · 2017年8月18日

Advanced Semantics for Commonsense Knowledge Extraction

Arxiv

6+阅读 · 2021年2月12日

Evaluating Multimodal Representations on Visual Semantic Textual Similarity

Evaluating Multimodal Representations on Visual Semantic Textual Similarity

Arxiv

6+阅读 · 2020年4月4日

Semantics-aware BERT for Language Understanding

Arxiv

4+阅读 · 2019年9月5日

NAIS: Neural Attentive Item Similarity Model for Recommendation

Arxiv

3+阅读 · 2018年9月19日

Structure Aware SLAM using Quadrics and Planes

Structure Aware SLAM using Quadrics and Planes

Arxiv

4+阅读 · 2018年8月13日

Semantic Parsing: Syntactic assurance to target sentence using LSTM Encoder CFG-Decoder

Semantic Parsing: Syntactic assurance to target sentence using LSTM Encoder CFG-Decoder

Arxiv

4+阅读 · 2018年7月18日

Neural Network Models for Paraphrase Identification, Semantic Textual Similarity, Natural Language Inference, and Question Answering

Arxiv

7+阅读 · 2018年6月12日

Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval

Arxiv

7+阅读 · 2018年6月3日

SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text

Arxiv

5+阅读 · 2018年5月18日

Convolutional CRFs for Semantic Segmentation

Arxiv

8+阅读 · 2018年5月15日

VIP会员

文章信息

相关主题

语义相似度

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

SIGIR2021接受论文列表公布！151篇论文都在这了！

专知会员服务

38+阅读 · 2021年4月27日

【斯坦福大学】矩阵对策的协调方法，89页pdf

【斯坦福大学】矩阵对策的协调方法，89页pdf

专知会员服务

27+阅读 · 2020年9月18日

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

语义相似性算法演化论文，29页pdf，Evolution of Semantic Similarity - A Survey

专知会员服务

44+阅读 · 2020年4月30日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

相关资讯

LibRec 精选：AutoML for Contextual Bandits

LibRec 精选：AutoML for Contextual Bandits

LibRec智能推荐

7+阅读 · 2019年9月19日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

LibRec 精选：CCF TPCI 的推荐系统专刊征稿

LibRec 精选：CCF TPCI 的推荐系统专刊征稿

LibRec智能推荐

4+阅读 · 2019年1月12日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

Hierarchical Imitation - Reinforcement Learning

Hierarchical Imitation - Reinforcement Learning

CreateAMind

19+阅读 · 2018年5月25日

推荐｜深度强化学习聊天机器人（附论文）！

推荐｜深度强化学习聊天机器人（附论文）！

全球人工智能

4+阅读 · 2018年1月30日

carla无人驾驶模拟中文项目 carla_simulator_Chinese

carla无人驾驶模拟中文项目 carla_simulator_Chinese

CreateAMind

3+阅读 · 2018年1月30日

LibRec 每周精选：10篇每个人都应该读的RecSys文章

LibRec 每周精选：10篇每个人都应该读的RecSys文章

LibRec智能推荐

5+阅读 · 2018年1月1日

【推荐】SLAM相关资源大列表

【推荐】SLAM相关资源大列表

机器学习研究会

10+阅读 · 2017年8月18日

相关论文

Advanced Semantics for Commonsense Knowledge Extraction

Arxiv

6+阅读 · 2021年2月12日

Evaluating Multimodal Representations on Visual Semantic Textual Similarity

Evaluating Multimodal Representations on Visual Semantic Textual Similarity

Arxiv

6+阅读 · 2020年4月4日

Semantics-aware BERT for Language Understanding

Arxiv

4+阅读 · 2019年9月5日

NAIS: Neural Attentive Item Similarity Model for Recommendation

Arxiv

3+阅读 · 2018年9月19日

Structure Aware SLAM using Quadrics and Planes

Structure Aware SLAM using Quadrics and Planes

Arxiv

4+阅读 · 2018年8月13日

Semantic Parsing: Syntactic assurance to target sentence using LSTM Encoder CFG-Decoder

Semantic Parsing: Syntactic assurance to target sentence using LSTM Encoder CFG-Decoder

Arxiv

4+阅读 · 2018年7月18日

Neural Network Models for Paraphrase Identification, Semantic Textual Similarity, Natural Language Inference, and Question Answering

Arxiv

7+阅读 · 2018年6月12日

Entity-Duet Neural Ranking: Understanding the Role of Knowledge Graph Semantics in Neural Information Retrieval

Arxiv

7+阅读 · 2018年6月3日

SemStyle: Learning to Generate Stylised Image Captions using Unaligned Text

Arxiv

5+阅读 · 2018年5月18日

Convolutional CRFs for Semantic Segmentation

Arxiv

8+阅读 · 2018年5月15日

微信扫码咨询专知VIP会员