Many decision-making processes involve evaluating and then selecting items; examples include scientific peer review, job hiring, school admissions, and investment decisions. The eventual selection is performed by applying rules or deliberations to the raw evaluations, and then deterministically selecting the items deemed to be the best. These domains feature error-prone evaluations and uncertainty about future outcomes, which undermine the reliability of such deterministic selection rules. As a result, selection mechanisms involving explicit randomization that incorporate the uncertainty are gaining traction in practice. However, current randomization approaches are ad hoc, and as we prove, inappropriate for their purported objectives. In this paper, we propose a principled framework for randomized decision-making based on interval estimates of the quality of each item. We introduce MERIT (Maximin Efficient Randomized Interval Top-k), an optimization-based method that maximizes the worst-case expected number of top candidates selected, under uncertainty represented by overlapping intervals (e.g., confidence intervals or min-max intervals). MERIT provides an optimal resource allocation scheme under an interpretable notion of robustness. We develop a polynomial-time algorithm to solve the optimization problem and demonstrate empirically that the method scales to over 10,000 items. We prove that MERIT satisfies desirable axiomatic properties not guaranteed by existing approaches. Finally, we empirically compare algorithms on synthetic peer review data. Our experiments demonstrate that MERIT matches the performance of existing algorithms in expected utility under fully probabilistic review data models used in previous work, while outperforming previous methods with respect to our novel worst-case formulation.
翻译:许多决策过程涉及对项目进行评估与选择,例如科学同行评审、人才招聘、学校招生和投资决策。最终选择通常通过对原始评估结果应用规则或审议,然后确定性地选取被认为最优的项目。这些领域普遍存在易出错的评估和对未来结果的不确定性,这削弱了确定性选择规则的可靠性。因此,在决策中显式纳入随机化以反映不确定性的选择机制在实践中日益受到关注。然而,现有的随机化方法多为临时性方案,且如我们证明的,往往无法实现其宣称的目标。本文提出一种基于项目质量区间估计的原理性随机决策框架。我们引入MERIT(基于区间估计的最大最小效率随机化Top-k选择方法),这是一种基于优化的方法,在由重叠区间(如置信区间或最小-最大区间)表示的不确定性下,最大化所选顶尖候选者的最坏情况期望数量。MERIT在可解释的鲁棒性概念下提供了最优的资源分配方案。我们开发了多项式时间算法求解该优化问题,并通过实验证明该方法可扩展至超过10,000个项目。我们证明了MERIT满足现有方法无法保证的若干理想公理性质。最后,我们在合成同行评审数据上对算法进行了实证比较。实验表明,在先前工作中使用的完全概率化评审数据模型下,MERIT在期望效用方面与现有算法性能相当,同时在我们提出的新颖最坏情况优化目标上优于先前方法。