Blame attribution is one of the key aspects of accountable decision making, as it provides means to quantify the responsibility of an agent for a decision making outcome. In this paper, we study blame attribution in the context of cooperative multi-agent sequential decision making. As a particular setting of interest, we focus on cooperative decision making formalized by Multi-Agent Markov Decision Processes (MMDP), and we analyze different blame attribution methods derived from or inspired by existing concepts in cooperative game theory. We formalize desirable properties of blame attribution in the setting of interest, and we analyze the relationship between these properties and the studied blame attribution methods. Interestingly, we show that some of the well known blame attribution methods, such as Shapley value, are not performance-incentivizing, while others, such as Banzhaf index, may over-blame agents. To mitigate these value misalignment and fairness issues, we introduce a novel blame attribution method, unique in the set of properties it satisfies, which trade-offs explanatory power (by under-blaming agents) for the aforementioned properties. We further show how to account for uncertainty about agents' decision making policies, and we experimentally: a) validate the qualitative properties of the studied blame attribution methods, and b) analyze their robustness to uncertainty.
翻译:责任归属是负责任的决策的关键方面之一,因为它提供了量化一个代理人对决策结果的责任的手段。在本文中,我们研究了在合作性多代理人顺序决策背景下的责任归属问题。作为一个特殊的利益背景,我们侧重于由多代理人马尔科夫决策程序(MDP)正式确定的合作决策,我们分析合作性游戏理论中现有概念产生或启发的不同责任归属方法。我们在利益设定中将责任归属的适当属性正式化,我们分析这些属性与所研究的归责方法之间的关系。有趣的是,我们表明一些众所周知的归责方法,如Shapley价值,不是鼓励业绩归属方法,而其他一些方法,如Banzhaf指数,可能过度指责。为了减轻这些价值的不匹配和公平问题,我们采用了一种全新的归责归属方法,这是它所满足的一套特性中的独特,即交易性解释权(被指责的代理人)对于上述属性具有独特性。我们进一步说明如何解释代理人决策的不确定性,我们实验性地分析了其归属性。我们研究了质量属性。