Peer grading systems make large courses more scalable, provide students with faster and more detailed feedback, and help students to learn by thinking critically about the work of others. A key obstacle to the broader adoption of peer grading systems is motivating students to provide accurate grades. The literature has explored many different approaches to incentivizing accurate grading (which we survey in detail), but the strongest incentive guarantees have been offered by mechanisms that compare peer grades to trusted TA grades with a fixed probability. In this work, we show that less TA work is required when these probabilities are allowed to depend on the grades that students report. We prove this result in a model with two possible grades, arbitrary numbers of agents, no requirement that students grade multiple assignments, arbitrary but homogeneous noisy observation of the ground truth which students can pay a fixed cost to observe, and the possibility of misreporting grades before or after observing this signal. We give necessary and sufficient conditions for our new mechanism's feasibility, prove its optimality under these assumptions, and characterize its improvement over the previous state of the art both analytically and empirically. Finally, we relax our homogeneity assumption, allowing each student and TA to observe the ground truth according to a different noise model.
翻译:同行评分制度使大型课程更加可扩展,为学生提供更快和更详细的反馈,帮助学生通过批判性地思考他人的工作来学习。更广泛地采用同行评分制度的一个关键障碍是激励学生提供准确的分数。文献探讨了激励准确分数的多种不同方法(我们对此进行了详细调查),但将同龄分数与信任的TA分数和固定概率进行比较的机制提供了最有力的激励保障。在这项工作中,我们表明,如果允许这些概率取决于学生报告的年级,就较少需要开展TA工作。我们证明,这导致了一种模式,有两种可能分数,任意的代理,没有要求学生分多职,任意但同质地对真理进行观察,也没有要求学生在观察信号之前或之后支付固定费用,对成绩进行误报的可能性。我们为我们的新机制的可行性提供了必要和充分的条件,证明了这些假设的优化性,并说明了它比以往的艺术水平都要改进。最后,我们放松了我们的同系性假设,允许每个学生和TA之间的噪音以不同的方式观察地面。