It is common to evaluate a set of items by soliciting people to rate them. For example, universities ask students to rate the teaching quality of their instructors, and conference organizers ask authors of submissions to evaluate the quality of the reviews. However, in these applications, students often give a higher rating to a course if they receive higher grades in a course, and authors often give a higher rating to the reviews if their papers are accepted to the conference. In this work, we call these external factors the "outcome" experienced by people, and consider the problem of mitigating these outcome-induced biases in the given ratings when some information about the outcome is available. We formulate the information about the outcome as a known partial ordering on the bias. We propose a debiasing method by solving a regularized optimization problem under this ordering constraint, and also provide a carefully designed cross-validation method that adaptively chooses the appropriate amount of regularization. We provide theoretical guarantees on the performance of our algorithm, as well as experimental evaluations.
翻译:例如,大学要求学生评定教师的教学质量,会议组织者则要求提交材料的作者评估审查的质量。然而,在这些应用中,学生如果在课程中获得较高等级,通常对课程给予更高的评分,作者通常对审查给予较高的评分,如果他们的论文被会议接受,则对审查给予较高的评分。在这项工作中,我们将这些外部因素称为人们所经历的“结果”,并在获得关于结果的一些信息时考虑在给定评分中减少这些结果引起的偏见的问题。我们将有关结果的信息编成已知的关于偏见的部分定购。我们建议通过在这种定序限制下解决正规化的优化问题来消除偏见的方法,并且提供精心设计的交叉校准方法,以适应性地选择适当的规范数额。我们为算法的绩效以及实验性评估提供理论保证。