In peer review systems, reviewers are often asked to evaluate various features of submissions, such as technical quality or novelty. A score is given to each of the predefined features and based on these the reviewer has to provide an overall quantitative recommendation. However, reviewers differ in how much they value different features. It may be assumed that each reviewer has her own mapping from a set of criteria scores (score vectors) to a recommendation, and that different reviewers have different mappings in mind. Recently, Noothigattu, Shah and Procaccia introduced a novel framework for obtaining an aggregated mapping by means of Empirical Risk Minimization based on $L(p,q)$ loss functions, and studied its axiomatic properties in the sense of social choice theory. We provide a body of new results about this framework. On the one hand we study a trade-off between strategy-proofness and the ability of the method to properly capture agreements of the majority of reviewers. On the other hand, we show that dropping a certain unrealistic assumption makes the previously reported results to be no longer valid. Moreover, in the general case, strategy-proofness fails dramatically in the sense that a reviewer is able to make significant changes to the solution in her favor by arbitrarily small changes to their true beliefs. In particular, no approximate version of strategy-proofness is possible in this general setting since the method is not even continuous w.r.t. the data. Finally we propose a modified aggregation algorithm which is continuous and show that it has good axiomatic properties.
翻译:在同侪审查体系中,经常要求审查者评估提交材料的不同特点,例如技术质量或新颖性;对每个预先界定的特性给予一个分数,审查者必须根据这些特点提供总体定量建议;然而,审查者对不同特性的价值有不同之处;可以假定,每个审查者有自己的地图,从一套标准分数(核心矢量)到一项建议,不同审查者有不同的图象;最近,Noothigattu、Shah和Procaccia引入了一个新框架,以便通过根据美元(p,q)损失功能实现实证风险最小化的综合绘图,并在此基础上研究其不言明性特性;然而,从社会选择理论的意义上说,我们提供了一系列关于这一框架的新结果。一方面,我们研究的是战略的准确性和正确性方法在正确获取多数审查协议的能力之间的取舍。另一方面,我们发现,放弃某种不切实际的假设使得先前报告的结果不再有效。此外,在一般情况下,战略的准确性、准确性、最终的判断性能显示一个显著的准确性、最终判断是确定一个可能的方法。