As predictive models are increasingly being deployed to make a variety of consequential decisions, there is a growing emphasis on designing algorithms that can provide recourse to affected individuals. Existing recourse algorithms function under the assumption that the underlying predictive model does not change. However, models are regularly updated in practice for several reasons including data distribution shifts. In this work, we make the first attempt at understanding how model updates resulting from data distribution shifts impact the algorithmic recourses generated by state-of-the-art algorithms. We carry out a rigorous theoretical and empirical analysis to address the above question. Our theoretical results establish a lower bound on the probability of recourse invalidation due to model shifts, and show the existence of a tradeoff between this invalidation probability and typical notions of "cost" minimized by modern recourse generation algorithms. We experiment with multiple synthetic and real world datasets, capturing different kinds of distribution shifts including temporal shifts, geospatial shifts, and shifts due to data correction. These experiments demonstrate that model updation due to all the aforementioned distribution shifts can potentially invalidate recourses generated by state-of-the-art algorithms. Our findings thus not only expose previously unknown flaws in the current recourse generation paradigm, but also pave the way for fundamentally rethinking the design and development of recourse generation algorithms.
翻译:随着预测模型的日益部署以作出各种随之而来的决定,人们越来越强调设计能够向受影响个人提供求助的算法,而现有的追索算法则在基本预测模型没有改变的假设下发挥作用;然而,由于数据分布的变化等若干原因,这些模型在实践中经常更新。在这项工作中,我们第一次尝试了解数据分配所产生的模型更新如何影响最先进的算法所产生的算法。我们进行了严格的理论和实验性分析,以解决上述问题。我们的理论结果对因模型变换而导致追索无效的可能性设定了一个较低的界限,并显示了这种失效概率与现代追索生成算法将“成本”的典型概念之间的权衡。我们实验了多种合成和真实世界数据集,捕捉了不同的分配变化,包括时间变换、地理空间变迁和数据校正。这些实验表明,由于所有上述分配变迁而产生的模型升级有可能使由最新算法产生的追索方法无效。因此,我们的调查结果不仅暴露了现代追索生成算法模式的先前未知的缺陷,而且还铺平了当代追索算法的发展模式。