As predictive models are deployed into the real world, they must increasingly contend with strategic behavior. A growing body of work on strategic classification treats this problem as a Stackelberg game: the decision-maker "leads" in the game by deploying a model, and the strategic agents "follow" by playing their best response to the deployed model. Importantly, in this framing, the burden of learning is placed solely on the decision-maker, while the agents' best responses are implicitly treated as instantaneous. In this work, we argue that the order of play in strategic classification is fundamentally determined by the relative frequencies at which the decision-maker and the agents adapt to each other's actions. In particular, by generalizing the standard model to allow both players to learn over time, we show that a decision-maker that makes updates faster than the agents can reverse the order of play, meaning that the agents lead and the decision-maker follows. We observe in standard learning settings that such a role reversal can be desirable for both the decision-maker and the strategic agents. Finally, we show that a decision-maker with the freedom to choose their update frequency can induce learning dynamics that converge to Stackelberg equilibria with either order of play.
翻译:随着预测模型被部署到现实世界,它们必须越来越多地与战略行为竞争。越来越多的战略分类工作把这一问题当作斯大克尔贝格游戏来对待:在游戏中,决策者“带头”通过部署模型,而战略代理人则通过对部署模型发挥最佳反应来“追随”。重要的是,在这一框架中,学习负担完全由决策者承担,而代理人的最佳反应则被暗含地视为瞬间。在这项工作中,我们争辩说,战略分类的顺序基本上取决于决策者和代理人适应对方行动的相对频率。特别是,通过普及标准模型,让双方行为者能够长期学习,我们表明,一个比代理人更快更新的决策者能够扭转游戏顺序,这意味着代理人领导和决策者可以跟随。我们在标准学习环境中看到,决策者和战略代理人最好能够以瞬间的方式进行这种作用的逆转。最后,我们表明,一个选择其更新频率的决策者可以引导学习动态,这种动态与 Stakkkelberg equiribrial 和 Stackelbribrial 的顺序交汇。