Context: The need of replicating empirical studies in Computer Science (CS) is widely recognized among the research community to consolidate acquired knowledge generalizing results. It is essential to report the changes of each replication to understand the evolution of the experimental validity across a family of studies. Unfortunately, the lack of proposals undermines these objectives. Objective. The main goal of our work is to provide researchers in CS, and in other areas of research, with a systematic, tool-supported approach for the reporting of changes in the replications of their empirical studies. Method: Applying DSR, we have developed and validated a composite artifact consisting of (i) a metamodel of the relevant concepts of replications and their changes; (ii) templates and linguistic patterns for reporting those concepts; and (iii) a proof-of-concept model-based software tool that supports the proposed approach. For its validation, we have carried out a multiple case study including 9 families of empirical studies from CS and Agrobiology. The 9 families encompass 23 replication studies and 92 replication changes, for which we have analyzed the suitability of our proposal. Results: The multiple case study revealed some initial limitations of our approach related to threats to experimental validity or context variables. After several improvement iterations, all the 92 replication changes could be properly specified, including also their qualitatively estimated effects on experimental validity across the family of experiments and its corresponding visualization. Conclusions: Our proposal for the specification of replication changes seems to fit the needs not only of replications in CS, but also in other research areas. Nevertheless, further research is needed to improve it and to disseminate its use among the research community.
翻译:研究界广泛认识到需要复制计算机科学的经验性研究,以巩固已获得的知识,普及成果;必须报告每次复制的变化,以了解整个研究系列实验有效性的演变情况;不幸的是,缺乏建议破坏了这些目标; 目标:我们工作的主要目标是向计算机科学和其他研究领域的研究人员提供计算机科学和其他研究领域的经验性研究的系统、工具支持的方法,以报告其经验性研究的复制情况。方法:应用DSR,我们开发并验证了一个综合工艺品,其中包括:(一) 复制及其变化相关概念的元模型;(二) 用于报告这些概念的模板和语言模式;以及(三) 支持拟议方法的理论性示范性模型性软件工具。为了证实这一点,我们开展了一项多项案例研究,包括9个CS和Agrobilog的实验性研究系列。 9个家庭仅包括23种复制性研究和92种复制性变化,为此我们分析了我们提案的适宜性。结果:多项案例研究表明,我们的方法在最初的复制性研究领域存在一些局限性,其质量性研究的准确性影响是其实验性研究的正确性,因此,其价值性研究的精确性研究的精确性研究范围似乎包括:其所有实验性研究的精确性研究的精确性研究领域,其价值上,其所有价值的精确性研究的精确性研究的精确性研究范围。