Software Vulnerability Prediction (SVP) is a data-driven technique for software quality assurance that has recently gained considerable attention in the Software Engineering research community. However, the difficulties of preparing Software Vulnerability (SV) related data is considered as the main barrier to industrial adoption of SVP approaches. Given the increasing, but dispersed, literature on this topic, it is needed and timely to systematically select, review, and synthesize the relevant peer-reviewed papers reporting the existing SV data preparation techniques and challenges. We have carried out a Systematic Literature Review (SLR) of SVP research in order to develop a systematized body of knowledge of the data preparation challenges, solutions, and the needed research. Our review of the 61 relevant papers has enabled us to develop a taxonomy of data preparation for SVP related challenges. We have analyzed the identified challenges and available solutions using the proposed taxonomy. Our analysis of the state of the art has enabled us identify the opportunities for future research. This review also provides a set of recommendations for researchers and practitioners of SVP approaches.
翻译:软件脆弱性预测(SVP)是软件工程研究界最近相当重视的一种以数据驱动的软件质量保证技术,然而,编制软件脆弱性相关数据的困难被认为是工业采用SVP方法的主要障碍。鉴于关于这一专题的文献越来越多,但很分散,因此需要及时系统地选择、审查并综合相关的同行审查文件,报告现有的SV数据编制技术和挑战。我们进行了SVP研究的系统文学审查,以形成关于数据编制挑战、解决办法和所需研究的系统化知识。我们对61份相关文件的审查使我们能够为SVP相关挑战制定数据编制分类。我们利用拟议的分类分析分析了已查明的挑战和现有解决办法。我们对技术现状的分析使我们能够确定未来研究的机会。这次审查还为SVP方法的研究人员和从业人员提供了一套建议。