We formulate, analyze and solve the problem of best arm identification with fairness constraints on subpopulations (BAICS). Standard best arm identification problems aim at selecting an arm that has the largest expected reward where the expectation is taken over the entire population. The BAICS problem requires that an selected arm must be fair to all subpopulations (e.g., different ethnic groups, age groups, or customer types) by satisfying constraints that the expected reward conditional on every subpopulation needs to be larger than some thresholds. The BAICS problem aims at correctly identify, with high confidence, the arm with the largest expected reward from all arms that satisfy subpopulation constraints. We analyze the complexity of the BAICS problem by proving a best achievable lower bound on the sample complexity with closed-form representation. We then design an algorithm and prove that the algorithm's sample complexity matches with the lower bound in terms of order. A brief account of numerical experiments are conducted to illustrate the theoretical findings.
翻译:本文研究了一个基于子人群公平约束的最佳手臂识别问题(BAICS)。标准最佳手臂识别问题旨在选择一个期望回报最大的手臂,其中期望是在整个人群上取的。BAICS问题要求所选择的手臂必须对所有子人群(如不同的种族、年龄组或客户类型)都公平,通过满足期望回报条件限制。BAICS问题的目标是正确地识别出在满足子人群约束条件下预期回报最大的所有手臂中的最佳手臂,且具有高置信度。我们分析了BAICS问题的复杂性,通过证明一个具有闭合形式的样本复杂度最优下限来完成分析。随后,我们设计了一种算法,并证明该算法的样本复杂度与下限的阶相匹配。最后,通过简要的数值实验来展示理论结果。