Blame attribution is one of the key aspects of accountable decision making, as it provides means to quantify the responsibility of an agent for a decision making outcome. In this paper, we study blame attribution in the context of cooperative multi-agent sequential decision making. As a particular setting of interest, we focus on cooperative decision making formalized by Multi-Agent Markov Decision Processes (MMDPs), and we analyze different blame attribution methods derived from or inspired by existing concepts in cooperative game theory. We formalize desirable properties of blame attribution in the setting of interest, and we analyze the relationship between these properties and the studied blame attribution methods. Interestingly, we show that some of the well known blame attribution methods, such as Shapley value, are not performance-incentivizing, while others, such as Banzhaf index, may over-blame agents. To mitigate these value misalignment and fairness issues, we introduce a novel blame attribution method, unique in the set of properties it satisfies, which trade-offs explanatory power (by under-blaming agents) for the aforementioned properties. We further show how to account for uncertainty about agents' decision making policies, and we experimentally: a) validate the qualitative properties of the studied blame attribution methods, and b) analyze their robustness to uncertainty.
翻译:责任归属是问责决策的关键方面之一,因为它提供了量化决策结果代理人责任的手段。在本文中,我们研究了在合作性多代理人顺序决策背景下的责怪归属问题。作为一个特别的兴趣背景,我们侧重于由多代理Markov决策程序(MDPs)正式确定的合作决策,我们分析了合作性游戏理论中现有概念产生或启发的不同责怪归属方法。我们在利益设定中将责任归属的适当属性正式化,我们分析了这些属性与所研究的责怪归属方法之间的关系。有趣的是,我们表明,一些众所周知的责怪归属方法,如Shaply价值,不是鼓励业绩归属方法,而其他方法,如Banzhaf指数,可能过度指责因素。为了减轻这些价值的不匹配和公平问题,我们采用了一种新的责怪责归属方法,这是它所满足的一套特性的独特,即交易性解释权(被指责的代理人)对于上述属性具有独特性。我们进一步展示了对代理人决策不确定性进行核算的方法,例如Shale价值,我们研究其可靠性。