Many selection processes such as finding patients qualifying for a medical trial or retrieval pipelines in search engines consist of multiple stages, where an initial screening stage focuses the resources on shortlisting the most promising candidates. In this paper, we investigate what guarantees a screening classifier can provide, independently of whether it is constructed manually or trained. We find that current solutions do not enjoy distribution-free theoretical guarantees -- we show that, in general, even for a perfectly calibrated classifier, there always exist specific pools of candidates for which its shortlist is suboptimal. Then, we develop a distribution-free screening algorithm -- called Calibrated Subset Selection (CSS) -- that, given any classifier and some amount of calibration data, finds near-optimal shortlists of candidates that contain a desired number of qualified candidates in expectation. Moreover, we show that a variant of our algorithm that calibrates a given classifier multiple times across specific groups can create shortlists with provable diversity guarantees. Experiments on US Census survey data validate our theoretical results and show that the shortlists provided by our algorithm are superior to those provided by several competitive baselines.
翻译:许多选择过程,例如找到有资格在搜索引擎中接受医疗试验或检索管道的病人,这些选择过程包括多个阶段,初步筛选阶段将资源集中到最有希望的候选人名单上。在本文中,我们调查一个筛选分类师可以提供哪些保障,而不管它是人工还是经过培训的。我们发现,目前的解决方案并不享受免费分配的理论保障 -- -- 我们发现,一般来说,即使是一个完全校准的分类师,也总是有其短名单不最优化的特定候选人群体。然后,我们开发了一个免费分配的筛选算法 -- -- 称为校准的子集选择(CSS) -- -- 考虑到任何分类员和某些数量的校准数据,我们发现近于最佳的候选人短名单,其中含有期望的合格候选人人数。此外,我们展示了一种对特定分类师进行多次校准的算法的变式,这种算法可以在特定群体中创建短名单,其短名单中的短名单将保证多样性。关于人口普查的实验数据验证了我们的理论结果,并显示,我们的算法提供的短名单优于几个竞争性基线提供的名单。