It has recently been shown that if feedback effects of decisions are ignored, then imposing fairness constraints such as demographic parity or equality of opportunity can actually exacerbate unfairness. We propose to address this challenge by modeling feedback effects as Markov decision processes (MDPs). First, we propose analogs of fairness properties for the MDP setting. Second, we propose algorithms for learning fair decision-making policies for MDPs. Finally, we demonstrate the need to account for dynamical effects using simulations on a loan applicant MDP.
翻译:最近已经表明,如果决定的反馈效应被忽视,那么实行人口均等或机会平等等公平限制实际上会加剧不公平现象。我们提议通过将反馈效应建模作为Markov决策过程(MDPs)来应对这一挑战。首先,我们为MDP环境提出了公平属性的类比。第二,我们提出了为MDP学习公平决策政策的算法。最后,我们证明有必要使用模拟对贷款申请人MDP进行模拟来说明动态效应。