走向一个解释神经法模型的因果关系理论 (Toward a Theory of Causation for Interpreting Neural Code Models)

Neural Language Models of Code, or Neural Code Models (NCMs), are rapidly progressing from research prototypes to commercial developer tools. As such, understanding the capabilities and limitations of such models is becoming critical. However, the abilities of these models are typically measured using automated metrics that often only reveal a portion of their real-world performance. While, in general, the performance of NCMs appears promising, currently much is unknown about how such models arrive at decisions. To this end, this paper introduces $do_{code}$, a post-hoc interpretability methodology specific to NCMs that is capable of explaining model predictions. $do_{code}$ is based upon causal inference to enable programming language-oriented explanations. While the theoretical underpinnings of $do_{code}$ are extensible to exploring different model properties, we provide a concrete instantiation that aims to mitigate the impact of spurious correlations by grounding explanations of model behavior in properties of programming languages. To demonstrate the practical benefit of $do_{code}$, we illustrate the insights that our framework can provide by performing a case study on two popular deep learning architectures and nine NCMs. The results of this case study illustrate that our studied NCMs are sensitive to changes in code syntax and statistically learn to predict tokens related to blocks of code (e.g., brackets, parenthesis, semicolon) with less confounding bias as compared to other programming language constructs. These insights demonstrate the potential of $do_{code}$ as a useful model debugging mechanism that may aid in discovering biases and limitations in NCMs.

翻译：代码的神经语言模型或神经代码模型(NCM)正在从研究原型迅速进步到商业开发工具。因此,理解这些模型的能力和局限性正在变得至关重要。但是,这些模型的能力通常使用自动度量来衡量,往往只显示其真实世界性能的一部分。虽然一般来说,NCMS的表现看似有希望,但对于这些模型是如何形成决定的,目前还不清楚。为此,本文件引入了$do ⁇ code}美元,这是一个适合NCMS的、能够解释模型预测的后可理解性方法。 $do ⁇ code}美元基于因果推论,以便能够编程以语言为导向的解释解释。尽管美元代码的理论基础通常能够用来探索不同的模型属性。我们提供了具体的即时速,目的是通过用模型的特性解释模型行为来减轻这些模型行为的影响。为了证明$do ⁇ codecodecol$的实用效益,我们的框架可以通过进行案例研究来解释。 $dodo{codecode$(codeal)$) 美元, 美元基于因果关系推算性推算性推算, 也就是我们国家数据库的模型模型的模型的模型的模型和模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的模型的变数, 和模型的变数, 解算法学变法学变的模型的解算法学的模型的变。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日