In empirical software engineering, benchmarks can be used for comparing different methods, techniques and tools. However, the recent ACM SIGSOFT Empirical Standards for Software Engineering Research do not include an explicit checklist for benchmarking. In this paper, we discuss benchmarks for software performance and scalability evaluation as example research areas in software engineering, relate benchmarks to some other empirical research methods, and discuss the requirements on benchmarks that may constitute the basis for a checklist of a benchmarking standard for empirical software engineering research.
翻译:在实证软件工程中,基准可用于比较不同方法、技术和工具,然而,最近的ACM SIGSOFT软件工程研究经验标准没有列入明确的基准清单,在本文件中,我们将软件性能和可扩缩性评价基准作为软件工程的示范研究领域加以讨论,将基准与其他一些经验研究方法联系起来,并讨论可作为实证软件工程研究基准标准清单基础的基准要求。