野生的算法利用:了解数据和模型转换的影响 (Algorithmic Recourse in the Wild: Understanding the Impact of Data and Model Shifts)

As predictive models are increasingly being deployed to make a variety of consequential decisions, there is a growing emphasis on designing algorithms that can provide recourse to affected individuals. Existing recourse algorithms function under the assumption that the underlying predictive model does not change. However, models are regularly updated in practice for several reasons including data distribution shifts. In this work, we make the first attempt at understanding how model updates resulting from data distribution shifts impact the algorithmic recourses generated by state-of-the-art algorithms. We carry out a rigorous theoretical and empirical analysis to address the above question. Our theoretical results establish a lower bound on the probability of recourse invalidation due to model shifts, and show the existence of a tradeoff between this invalidation probability and typical notions of "cost" minimized by modern recourse generation algorithms. We experiment with multiple synthetic and real world datasets, capturing different kinds of distribution shifts including temporal shifts, geospatial shifts, and shifts due to data correction. These experiments demonstrate that model updation due to all the aforementioned distribution shifts can potentially invalidate recourses generated by state-of-the-art algorithms. Our findings thus not only expose previously unknown flaws in the current recourse generation paradigm, but also pave the way for fundamentally rethinking the design and development of recourse generation algorithms.

翻译：随着预测模型的日益部署以作出各种随之而来的决定,人们越来越强调设计能够向受影响个人提供求助的算法,而现有的追索算法则在基本预测模型没有改变的假设下发挥作用;然而,由于数据分布的变化等若干原因,这些模型在实践中经常更新。在这项工作中,我们第一次尝试了解数据分配所产生的模型更新如何影响最先进的算法所产生的算法。我们进行了严格的理论和实验性分析,以解决上述问题。我们的理论结果对因模型变换而导致追索无效的可能性设定了一个较低的界限,并显示了这种失效概率与现代追索生成算法将“成本”的典型概念之间的权衡。我们实验了多种合成和真实世界数据集,捕捉了不同的分配变化,包括时间变换、地理空间变迁和数据校正。这些实验表明,由于所有上述分配变迁而产生的模型升级有可能使由最新算法产生的追索方法无效。因此,我们的调查结果不仅暴露了现代追索生成算法模式的先前未知的缺陷,而且还铺平了当代追索算法的发展模式。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/