When subjected to automated decision-making, decision subjects may strategically modify their observable features in ways they believe will maximize their chances of receiving a favorable decision. In many practical situations, the underlying assessment rule is deliberately kept secret to avoid gaming and maintain competitive advantage. The resulting opacity forces the decision subjects to rely on incomplete information when making strategic feature modifications. We capture such settings as a game of Bayesian persuasion, in which the decision maker offers a form of recourse to the decision subject by providing them with an action recommendation (or signal) to incentivize them to modify their features in desirable ways. We show that when using persuasion, both the decision maker and decision subject are never worse off in expectation, while the decision maker can be significantly better off. While the decision maker's problem of finding the optimal Bayesian incentive-compatible (BIC) signaling policy takes the form of optimization over infinitely-many variables, we show that this optimization can be cast as a linear program over finitely-many regions of the space of possible assessment rules. While this reformulation simplifies the problem dramatically, solving the linear program requires reasoning about exponentially-many variables, even under relatively simple settings. Motivated by this observation, we provide a polynomial-time approximation scheme that recovers a near-optimal signaling policy. Finally, our numerical simulations on semi-synthetic data empirically illustrate the benefits of using persuasion in the algorithmic recourse setting.
翻译:当进行自动化决策时,决策主体可能会从战略上改变其可观察特征,其方式是他们认为最大限度地增加获得有利决定的机会。在许多实际情况下,基本评估规则被故意保密,以避免赌博和保持竞争优势。由此产生的不透明迫使决策主体在进行战略特征修改时依赖不完整信息。我们捕捉了诸如贝叶西亚说服游戏这样的环境,即决策者提供了行动建议(或信号),以激励他们以理想的方式修改其可观察特征。我们表明,在使用说服时,决策者和决定主题都永远不会比预期更糟糕,而决策者则会做得更好。虽然决策者在寻找最佳巴伊西亚激励兼容性(BBIC)信号政策时遇到的难题,其形式是对无限多变变量的优化,我们显示,这种优化可以被描绘成一个线性程序,而不是以有限的多种方式修改可能的评估规则的空间。我们这种重新制定的方法,甚至大大地简化了问题,而解决接近直线性方案时,需要用一个相对的指数性模型推理,我们最后要用一种指数推算的模型来恢复我们的数据。