We investigate the application of the Shapley value to quantifying the contribution of a tuple to a query answer. The Shapley value is a widely known numerical measure in cooperative game theory and in many applications of game theory for assessing the contribution of a player to a coalition game. It has been established already in the 1950s, and is theoretically justified by being the very single wealth-distribution measure that satisfies some natural axioms. While this value has been investigated in several areas, it received little attention in data management. We study this measure in the context of conjunctive and aggregate queries by defining corresponding coalition games. We provide algorithmic and complexity-theoretic results on the computation of Shapley-based contributions to query answers; and for the hard cases we present approximation algorithms.
翻译:我们调查了Shapley值用于量化一个图例对查询答案的贡献。 Shapley值是合作游戏理论和许多应用游戏理论来评估玩家对联盟游戏的贡献的众所周知的数字尺度。 它早在1950年代就已经建立起来,理论上是合理的,因为它是满足某些自然法理的单一财富分配尺度。 虽然在几个领域调查过这一数值,但在数据管理方面却很少受到注意。 我们通过界定相应的联盟游戏来结合和汇总查询来研究这一尺度。 我们在计算基于Shapley的缴款以查询答案时提供了算法和复杂理论结果; 对于我们提出的近似算法的难题,我们提供了算法和复杂理论结果。