Finding a representative cohort from a broad pool of candidates is a goal that arises in many contexts such as choosing governing committees and consumer panels. While there are many ways to define the degree to which a cohort represents a population, a very appealing solution concept is lexicographic maximality (leximax) which offers a natural (pareto-optimal like) interpretation that the utility of no population can be increased without decreasing the utility of a population that is already worse off. However, finding a leximax solution can be highly dependent on small variations in the utility of certain groups. In this work, we explore new notions of approximate leximax solutions with three distinct motivations: better algorithmic efficiency, exploiting significant utility improvements, and robustness to noise. Among other definitional contributions, we give a new notion of an approximate leximax that satisfies a similarly appealing semantic interpretation and relate it to algorithmically-feasible approximate leximax notions. When group utilities are linear over cohort candidates, we give an efficient polynomial-time algorithm for finding a leximax distribution over cohort candidates in the exact as well as in the approximate setting. Furthermore, we show that finding an integer solution to leximax cohort selection with linear utilities is NP-Hard.
翻译:在选择管理委员会和消费者小组等许多情况下,从广泛的候选人群体中寻找具有代表性的一组人是一个目标。虽然有许多方法可以界定一个群体代表人口的程度,但一个非常有吸引力的解决办法概念是地名录最大度(Leximax),它提供了一种自然的(最优化的)解释,即不能增加任何人口的功用,而不会降低本已更差的人口的功用。然而,找到一个Leximax解决方案可能高度取决于某些群体效用的微小变化。在这项工作中,我们探索了近似地法理学解决方案的新概念,其三个不同动机是:提高算法效率,利用显著的效用改进,以及对噪音的稳健性。在其它定义性贡献中,我们给出了一种近似地(Leximax)概念,它能够满足类似的具有吸引力的语义解释,并将它与逻辑上可行的约值近似近似近于Leximax概念的概念联系起来。当集团公用事业是线性比组候选人时,我们给出一种有效的多时算算法时间算法,用以找到准确和近似情况下组合候选人的分类分布。此外,我们用的是公司集团选择与直线性公用事业的方法选择。