可解释的 " 培训前守则模式:他们学习什么? 当他们不工作时? " (Explainable AI for Pre-Trained Code Models: What Do They Learn? When They Do Not Work?)

In recent years, there has been a wide interest in designing deep neural network-based models that automate downstream software engineering tasks, such as program document generation, code search, and program repair. Although the main objective of these studies is to improve the effectiveness of the downstream task, many studies only attempt to employ the next best neural network model, without a proper in-depth analysis of why a particular solution works or does not, on particular tasks or scenarios. In this paper, using an eXplainable AI (XAI) method (attention mechanism), we study state-of-the-art Transformer-based models (CodeBERT and GraphCodeBERT) on a set of software engineering downstream tasks: code document generation (CDG), code refinement (CR), and code translation (CT). We first evaluate the validity of the attention mechanism on each particular task. Then, through quantitative and qualitative studies, we identify what CodeBERT and GraphCodeBERT learn (put the highest attention on, in terms of source code token types), on these tasks. Finally, we show some of the common patterns when the model does not work as expected (perform poorly while the problem in hand is easy) and suggest recommendations that may alleviate the observed challenges.

翻译：近年来,人们广泛关注设计深层神经网络模型,这些模型使下游软件工程任务自动化,例如程序文件生成、代码搜索、程序维修等。虽然这些研究的主要目的是提高下游任务的效力,但许多研究只试图采用下一个最佳神经网络模型,而没有适当深入分析为什么特定解决方案在特定任务或情景上起作用或不起作用。在本文件中,我们使用一种可互换的 AI (XAI) 方法(注意机制),研究一套软件工程下游任务(CodeBERT和GapCoppCodeBERT) 最新模型(CodeBERT和GreabCodeBERT),研究一套软件工程下游任务:代码文件生成、代码完善(CDG)、代码改进(CR)和代码翻译(CT)。我们首先评价每个特定任务的关注机制的有效性,然后通过定量和定性研究,我们确定CoCBERT和GapogCodeBERT在这些任务上学到什么(在源代码符号类型方面给予最高度的注意)。最后,我们展示了一些共同模式的模式,当该模型没有如预期要解决的问题时,发现什么容易的问题。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日