Context: This work is based on property-based testing (PBT). PBT is an increasingly important form of software testing. Furthermore, it serves as a concrete gateway into the abstract area of formal methods. Specifically, we focus on students learning PBT methods. Inquiry: How well do students do at PBT? Our goal is to assess the quality of the predicates they write as part of PBT. Prior work introduced the idea of decomposing the predicate's property into a conjunction of independent subproperties. Testing the predicate against each subproperty gives a "semantic" understanding of their performance. Approach: The notion of independence of subproperties both seems intuitive and was an important condition in prior work. First, we show that this condition is overly restrictive and might hide valuable information: it both undercounts errors and makes it hard to capture misconceptions. Second, we introduce two forms of automation, one based on PBT tools and the other on SAT-solving, to enable testing of student predicates. Third, we compare the output of these automated tools against manually-constructed tests. Fourth, we also measure the performance of those tools. Finally, we re-assess student performance reported in prior work. Knowledge: We show the difficulty caused by the independent subproperty requirement. We provide insight into how to use automation effectively to assess PBT predicates. In particular, we discuss the steps we had to take to beat human performance. We also provide insight into how to make the automation work efficiently. Finally, we present a much richer account than prior work of how students did. Grounding: Our methods are grounded in mathematical logic. We also make use of well-understood principles of test generation from more formal specifications. This combination ensures the soundness of our work. We use standard methods to measure performance. Importance: As both educators and programmers, we believe PBT is a valuable tool for students to learn, and its importance will only grow as more developers appreciate its value. Effective teaching requires a clear understanding of student knowledge and progress. Our methods enable a rich and automated analysis of student performance on PBT that yields insight into their understanding and can capture misconceptions. We therefore expect these results to be valuable to educators.
翻译:这项工作基于基于财产的测试( PBT ) 。 PBT 是一个越来越重要的软件测试形式。 此外, 它是一个进入正式方法的抽象领域的具体通道。 具体地说, 我们侧重于学生学习 PBT 方法。 调查: 学生在 PBT 中做得如何? 我们的目标是评估他们作为 PBT 的一部分所写的前提的质量。 先前的工作引入了将上游属性分解成独立子体的组合。 测试每个子体层的顶端性能让“ 精度” 了解它们的性能。 方法: 亚体型独立的概念似乎不直观, 也是先前工作中的一个重要条件。 首先, 我们显示这种条件过于限制性, 可能隐藏有价值的信息: 它既低估错误, 也难以捕捉错误。 其次, 我们引入两种形式的自动化, 仅以PBT 工具为基础, 和另一个以解析为工具, 以测试学生的底部。 第三, 我们把这些自动工具的输出到手动的直观性能, 也让我们的直观。 最后, 我们测量了我们是如何在之前的工作表现。 最后, 我们又测量了我们用了自己的工具 。