Algorithmic decision-making in societal contexts, such as retail pricing, loan administration, recommendations on online platforms, etc., often involves experimentation with decisions for the sake of learning, which results in perceptions of unfairness among people impacted by these decisions. It is hence necessary to embed appropriate notions of fairness in such decision-making processes. The goal of this paper is to highlight the rich interface between temporal notions of fairness and online decision-making through a novel meta-objective of ensuring fairness at the time of decision. Given some arbitrary comparative fairness notion for static decision-making (e.g., students should pay at most 90% of the general adult price), a corresponding online decision-making algorithm satisfies fairness at the time of decision if the said notion of fairness is satisfied for any entity receiving a decision in comparison to all the past decisions. We show that this basic requirement introduces new methodological challenges in online decision-making. We illustrate the novel approaches necessary to address these challenges in the context of stochastic convex optimization with bandit feedback under a comparative fairness constraint that imposes lower bounds on the decisions received by entities depending on the decisions received by everyone in the past. The paper showcases novel research opportunities in online decision-making stemming from temporal fairness concerns.
翻译:在社会背景下,如零售定价、贷款管理、在线平台上的建议等,对决策进行实验,以学习为目的,往往涉及对决策进行实验,从而导致受这些决策影响的人对不公平的看法,因此有必要将适当的公平概念纳入此类决策进程。本文件的目的是通过确保决策时公平性的新颖的元目标,突出公平性概念与在线决策之间的丰富联系。鉴于静态决策的任意比较公平概念(例如,学生应支付成人总价格的90%),相应的在线决策算法在决策时满足公平性,如果与以往所有决定相比,任何接受决策的实体都满足上述公平概念。我们表明,这一基本要求在网上决策中引入了新的方法挑战。我们介绍了在以强势组合优化和强势反馈的背景下应对这些挑战所需的新办法,这种新办法对实体收到的决定规定了较低的约束,取决于过去每个人收到的决定。文件展示了在线决策的公平性机会。