在大型软件系统中对树木内核对委托时间分流探测的效用进行经验性评价 (An empirical evaluation of the usefulness of Tree Kernels for Commit-time Defect Detection in large software systems)

Defect detection at commit check-in time prevents the introduction of defects into software systems. Current defect detection approaches rely on metric-based models which are not very accurate and whose results are not directly useful for developers. We propose a method to detect bug-inducing commits by comparing the incoming changes with all past commits in the project, considering both those that introduced defects and those that did not. Our method considers individual changes in the commit separately, at the method-level granularity. Doing so helps developers as they are informed of specific methods that need further attention instead of being told that the entire commit is problematic. Our approach represents source code as abstract syntax trees and uses tree kernels to estimate the similarity of the code with previous commits. We experiment with subtree kernels (STK), subset tree kernels (SSTK), or partial tree kernels (PTK). An incoming change is then classified using a K-NN classifier on the past changes. We evaluate our approach on the BigCloneBench benchmark and on the Technical Debt dataset, using the NiCad clone detector as the baseline. Our experiments with the BigCloneBench benchmark show that the tree kernel approach can detect clones with a comparable MAP to that of NiCad. Also, on defect detection with the Technical Debt dataset, tree kernels are least as effective as NiCad with MRR, F-score, and Accuracy of 0.87, 0.80, and 0.82 respectively.

翻译：在承诺检查时发现缺陷后,无法将缺陷引入软件系统。目前的缺陷检测方法依靠基于标准的模型,这些模型不十分准确,其结果对开发者没有直接用处。我们提出一种方法,通过将即将发生的变化与项目中过去的所有承诺进行比较来检测诱导错误的承诺,其中既考虑到引入缺陷的变化,也考虑到部分树内核的承诺。我们的方法在方法层面的颗粒度上分别考虑承诺的个别变化。这样做有助于开发者了解需要更多关注的具体方法,而不是被告知整个承诺有问题。我们的方法代表了源代码,作为抽象的合成树,并且使用树内核来估计代码与以往承诺的相似性。我们试验了子树内核(STK)、子树内核(SSTK)或部分树内核(PTK),然后在方法上使用K-NN分类器对承诺进行分类。我们评价了我们在BigCloneBennch基准和技术债务数据集方面的做法,利用NiC克隆最低的数值探测器来估计该代码与以前的承诺测试基准。我们用BIC的试验可以用来作为BC的基核检测基准。