代码嵌入及以后的代码隐入及以后的比较 (A Comparison of Code Embeddings and Beyond)

Program representation learning is a fundamental task in software engineering applications. With the availability of "big code" and the development of deep learning techniques, various program representation learning models have been proposed to understand the semantic properties of programs and applied on different software engineering tasks. However, no previous study has comprehensively assessed the generalizability of these deep models on different tasks, so that the pros and cons of the models are unclear. In this experience paper, we try to bridge this gap by systemically evaluating the performance of eight program representation learning models on three common tasks, where six models are based on abstract syntax trees and two models are based on plain text of source code. We kindly explain the criteria for selecting the models and tasks, as well as the method for enabling end-to-end learning in each task. The results of performance evaluation show that they perform diversely in each task and the performance of the AST-based models is generally unstable over different tasks. In order to further explain the results, we apply a prediction attribution technique to find what elements are captured by the models and responsible for the predictions in each task. Based on the findings, we discuss some general principles for better capturing the information in the source code, and hope to inspire researchers to improve program representation learning methods for software engineering tasks.

翻译：在软件工程应用中,方案代表学习是一项基本任务。随着“大代码”的提供和深层次学习技术的开发,提出了各种方案代表学习模式,以了解程序词义特性,并应用于不同的软件工程任务。然而,以前没有一项研究全面评估了这些深层次模型对不同任务的一般性,因此模型的利弊并不明确。在本经验文件中,我们试图通过系统评估八个方案代表学习模式在三项共同任务方面的绩效来弥合这一差距。在这三个共同任务中,有六个模型以抽象的合成树为基础,两个模型以纯源代码为基础。我们想方设法解释选择模型和任务的标准,以及在每项任务中促进端到端学习的方法。业绩评价结果显示,这些模型在每项任务中表现各异,而且基于AST模型的绩效一般不稳定。为了进一步解释结果,我们采用了一种预测归属技术,以找出模型所捕捉的要素和每项任务的预测责任。根据调查结果,我们讨论一些一般性原则,以便在每项任务中促进最终学习学习方法,以便改进软件研究人员在源代码中更好地掌握希望。