Context: software projects are common resources in Software Engineering experiments, although these are often selected without following a specific strategy, which reduces the representativeness and replication of the results. An option is the use of preserved collections of software projects, but these must be current, with explicit guidelines that guarantee their updating over a long period of time. Goal: to carry out a systematic secondary study about the strategies to select software projects in empirical studies to discover the guidelines taken into account, the degree of use of project collections, the meta-data extracted and the subsequent statistical analysis conducted. Method: A systematic mapping study to identify studies published from January 2013 to December 2020. Results: 122 studies were identified, of which the 72% used their own guidelines for project selection and the 27% used existent project collections. Likewise, there was no evidence of a standardized framework for the project selection process, nor the application of statistical methods that relates with the sample collection strategy.
翻译:软件项目是软件工程实验的共同资源,尽管这些项目往往是在没有遵循具体战略的情况下选择的,从而降低了成果的代表性和复制力; 一种选择是使用保存的软件项目收藏,但必须及时进行,有明确的准则保证长期更新这些项目; 目标:就经验研究中选择软件项目的战略进行系统的二次研究,以发现考虑到的准则、项目收集的利用程度、提取的元数据以及随后进行的统计分析; 方法:系统绘图研究,以确定2013年1月至2020年12月公布的研究报告; 成果:查明了122项研究,其中72%的研究使用了自己的项目选择准则,27%使用了现有的项目收藏; 同样,没有证据表明项目选择过程的标准框架,也没有证据表明采用了与抽样收集战略有关的统计方法。