Background. Many mutation reduction strategies, which aim to reduce the number of mutants, have been proposed. Problem. It is important to measure the ability of a mutation reduction strategy to maintain test suite effectiveness evaluation. However, existing evaluation indicators are unable to measure the "order-preserving ability". Objective. We aim to propose evaluation indicators to measure the "order-preserving ability" of a mutation reduction strategy, which is important but missing in our community. Method. Given a test suite on a Software Under Test (SUT) with a set of original mutants, we leverage the test suite to generate a group of test suites that have a partial order relationship in fault detecting potential. When evaluating a reduction strategy, we first construct two partial order relationships among the generated test suites in terms of mutation score, one with the original mutants and another with the reduced mutants. Then, we measure the extent to which the two partial order relationships are consistent. The more consistent the two partial order relationships are, the stronger the Order Preservation (OP) of the mutation reduction strategy is, and the more effective the reduction strategy is. Furthermore, we propose Effort-aware Relative Order Preservation (EROP) to measure how much gain a mutation reduction strategy can provide compared with a random reduction strategy. Result. The experimental results show that OP and EROP are able to efficiently measure the "order-preserving ability" of a mutation reduction strategy. As a result, they have a better ability to distinguish various mutation reduction strategies compared with the existing evaluation indicators. Conclusion. We suggest, for the researchers, that OP and EROP should be used to measure the effectiveness of a mutant reduction strategy.
翻译:提出了许多旨在减少变异体数量的突变减少战略。 问题 。 重要的是要衡量变异减少战略的能力, 以保持测试套装的效能评估。 但是, 现有的评价指标无法衡量“ 秩序保存能力 ” 。 目标 。 我们的目标是提出评价指标, 以衡量突变减少战略的“ 秩序保存能力 ”, 这在社区中很重要, 但是在社区中却缺少。 方法 。 在测试软件的测试套件中, 有一套原始变异体。 我们利用测试套件来生成一组测试套件, 测试套件在检测潜力方面有部分顺序关系。 然而, 在评估削减战略时, 我们首先在生成的测试套套件之间建立两个部分秩序维持能力关系, 前者与初始变异变异体得分数, 后者与当前变异体变异体变异变体变异变异变体变异变异变体变异变异体的减值战略相比。 然后, 我们测量两个部分的顺序关系更加一致, 变异保全战略更强,, 变异变异变变变异战略的调(OP),, 和降低战略更战略更战略更有效战略更有效战略更有效战略 。 此外, 我们提议变异变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变法战略,,,,,,,, 战略,,,,,,,,,,,,,,,, 变变异变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变变法战略, 战略, 战略, 战略, 战略, 战略,,, 战略,, 战略, 变变变变变变变变变变变变变后战略, 战略,,, 战略,,,,,, 战略,,,,,,, 战略,