Software defect prediction models are classifiers that are constructed from historical software data. Such software defect prediction models have been proposed to help developers optimize the limited Software Quality Assurance (SQA) resources and help managers develop SQA plans. Prior studies have different goals for their defect prediction models and use different techniques for generating visual explanations of their models. Yet, it is unclear what are the practitioners' perceptions of (1) these defect prediction model goals, and (2) the model-agnostic techniques used to visualize these models. We conducted a qualitative survey to investigate practitioners' perceptions of the goals of defect prediction models and the model-agnostic techniques used to generate visual explanations of defect prediction models. We found that (1) 82%-84% of the respondents perceived that the three goals of defect prediction models are useful; (2) LIME is the most preferred technique for understanding the most important characteristics that contributed to a prediction of a file, while ANOVA/VarImp is the second most preferred technique for understanding the characteristics that are associated with software defects in the past. Our findings highlight the significance of investigating how to improve the understanding of defect prediction models and their predictions. Hence, model-agnostic techniques from explainable AI domain may help practitioners to understand defect prediction models and their predictions.
翻译:软件缺陷预测模型是根据历史软件数据构建的分类模型。这些软件缺陷预测模型是为了帮助开发者优化有限的软件质量保证(SQA)资源和帮助管理人员开发SQA计划。先前的研究对缺陷预测模型有不同的目标,使用不同的技术对其模型进行直观解释。然而,从业者对(1)这些缺陷预测模型目标和(2)用以直观这些模型的模型-不可知性技术的看法尚不清楚。我们进行了一次定性调查,以调查从业者对缺陷预测模型目标和用于生成缺陷预测模型直观解释的模型-不可知性技术的看法。我们发现:(1)82%-84%的受访者认为缺陷预测模型的三个目标有用;(2) LIME是最可取的用来理解有助于预测文件的最重要特征的技术,而ANOVA/VarImp是了解与过去软件缺陷相关特征的第二最可取的技术。我们的调查结果突出表明了调查如何帮助理解缺陷预测模型及其预测模型的重要性。因此,模型预测和内部预测技术可以解释。