Submodular functions have been a powerful mathematical model for a wide range of real-world applications. Recently, submodular functions are becoming increasingly important in machine learning (ML) for modelling notions such as information and redundancy among entities such as data and features. Among these applications, a key question is payoff allocation, i.e., how to evaluate the importance of each entity towards the collective objective? To this end, classic solution concepts from cooperative game theory offer principled approaches to payoff allocation. However, despite the extensive body of game-theoretic literature, payoff allocation in submodular games are relatively under-researched. In particular, an important notion that arises in the emerging submodular applications is redundancy, which may occur from various sources such as abundant data or malicious manipulations where a player replicates its resource and act under multiple identities. Though many game-theoretic solution concepts can be directly used in submodular games, naively applying them for payoff allocation in these settings may incur robustness issues against replication. In this paper, we systematically study the replication manipulation in submodular games and investigate replication robustness, a metric that quantitatively measures the robustness of solution concepts against replication. Using this metric, we present conditions which theoretically characterise the robustness of semivalues, a wide family of solution concepts including the Shapley and Banzhaf value. Moreover, we empirically validate our theoretical results on an emerging submodular ML application, i.e., the ML data market.
 翻译:子模块功能是一系列现实世界应用的强大数学模型。 最近,亚模块功能在诸如数据和特征等实体的信息和冗余等建模概念的机器学习(ML)中越来越重要。在这些应用中,关键问题是如何分配报酬,即如何评价每个实体对集体目标的重要性?为此,合作游戏理论的经典解决方案概念提供了支付分配的原则性方法。然而,尽管有大量游戏理论文献,但亚模块游戏的支付分配相对研究不足。特别是,在新兴亚模块应用中产生的一个重要概念是冗余,可能来自诸如大量数据或恶意操纵等各种来源,即一个玩家复制其资源并在多重身份下采取行动。尽管许多游戏理论性解决方案概念可以直接用于亚模式游戏,但将之用于这些环境下的支付分配可能引发强大的问题。在本文中,我们系统地研究在亚模块游戏中的复制操纵,并调查复制可靠性,一个测量标准,即一个内容丰富的数据或恶意操纵,一个衡量当前市场稳健性模型的模型,一个我们用来衡量当前货币价值的模型,一个价格模型,一个我们用来衡量当前市场中稳健性模型的模型的复制性模型。