Many selection processes such as finding patients qualifying for a medical trial or retrieval pipelines in search engines consist of multiple stages, where an initial screening stage focuses the resources on shortlisting the most promising candidates. In this paper, we investigate what guarantees a screening classifier can provide, independently of whether it is constructed manually or trained. We find that current solutions do not enjoy distribution-free theoretical guarantees -- we show that, in general, even for a perfectly calibrated classifier, there always exist specific pools of candidates for which its shortlist is suboptimal. Then, we develop a distribution-free screening algorithm -- called Calibrated Subset Selection (CSS) -- that, given any classifier and some amount of calibration data, finds near-optimal shortlists of candidates that contain a desired number of qualified candidates in expectation. Moreover, we show that a variant of CSS that calibrates a given classifier multiple times across specific groups can create shortlists with provable diversity guarantees. Experiments on US Census survey data validate our theoretical results and show that the shortlists provided by our algorithm are superior to those provided by several competitive baselines.
翻译:许多选择过程,例如找到有资格在搜索引擎中接受医疗试验或检索管道的病人,这些选择过程包括多个阶段,初步筛选阶段将资源集中到最有希望的候选人的短名单中。在本文中,我们调查一个筛选分类师可以提供哪些保障,而不管它是人工还是经过培训的。我们发现,目前的解决方案并不享有无分配的理论保障 -- -- 我们发现,一般来说,即使是一个完全校准的分类师,也总是有其短名单不最优的候选者。然后,我们开发了一个不分发的筛选算法 -- -- 称为校准子选择(CSS) -- -- 考虑到任何分类员和某些数量的校准数据,我们发现几乎最理想的候选人短名单,其中含有期望的合格候选人人数。此外,我们显示,一个对特定分类师进行多次校准、跨特定群体进行校准的变式,可以产生具有可辨定多样性保证的短名单。关于人口普查数据的实验证实了我们的理论结果,并表明,我们的算法提供的短名单优于几个竞争性基线提供的候选人。