Human decision making is well known to be imperfect and the ability to analyse such processes individually is crucial when attempting to aid or improve a decision-maker's ability to perform a task, e.g. to alert them to potential biases or oversights on their part. To do so, it is necessary to develop interpretable representations of how agents make decisions and how this process changes over time as the agent learns online in reaction to the accrued experience. To then understand the decision-making processes underlying a set of observed trajectories, we cast the policy inference problem as the inverse to this online learning problem. By interpreting actions within a potential outcomes framework, we introduce a meaningful mapping based on agents choosing an action they believe to have the greatest treatment effect. We introduce a practical algorithm for retrospectively estimating such perceived effects, alongside the process through which agents update them, using a novel architecture built upon an expressive family of deep state-space models. Through application to the analysis of UNOS organ donation acceptance decisions, we demonstrate that our approach can bring valuable insights into the factors that govern decision processes and how they change over time.
翻译:众所周知,人类决策不完善,在试图帮助或提高决策者执行任务的能力时,单独分析这些过程的能力至关重要,例如提醒他们注意他们可能存在的偏见或监督。为此,必须制定可解释的表述方法,说明代理人如何作出决定,以及随着时间推移,随着代理人根据累积的经验在网上学习,这一过程如何变化。然后,为了理解一组观察到的轨道背后的决策过程,我们把政策推论问题作为这一在线学习问题的反面。通过在潜在结果框架内解释行动,我们根据选择他们认为具有最大治疗效果的行动的代理人,进行有意义的绘图。我们采用一种实用算法,追溯估计这种预期效果,同时利用代理人更新这些作用的过程,利用一个建立在深层国家-空间模型表达式组合上的新结构。我们通过应用对UNOS器官捐赠接受决定的分析,表明我们的方法能够对决定过程的决定因素及其随时间的变化带来宝贵的洞察力。