Incentive mechanism is crucial for federated learning (FL) when rational clients do not have the same interests in the global model as the server. However, due to system heterogeneity and limited budget, it is generally impractical for the server to incentivize all clients to participate in all training rounds (known as full participation). The existing FL incentive mechanisms are typically designed by stimulating a fixed subset of clients based on their data quantity or system resources. Hence, FL is performed only using this subset of clients throughout the entire training process, leading to a biased model because of data heterogeneity. This paper proposes a game theoretic incentive mechanism for FL with randomized client participation, where the server adopts a customized pricing strategy that motivates different clients to join with different participation levels (probabilities) for obtaining an unbiased and high performance model. Each client responds to the server's monetary incentive by choosing its best participation level, to maximize its profit based on not only the incurred local cost but also its intrinsic value for the global model. To effectively evaluate clients' contribution to the model performance, we derive a new convergence bound which analytically predicts how clients' arbitrary participation levels and their heterogeneous data affect the model performance. By solving a non-convex optimization problem, our analysis reveals that the intrinsic value leads to the interesting possibility of bidirectional payment between the server and clients. Experimental results using real datasets on a hardware prototype demonstrate the superiority of our mechanism in achieving higher model performance for the server as well as higher profits for the clients.
翻译:在对于合理客户与服务器在全局模型上有不同利益的联邦学习中,激励机制至关重要。然而,由于系统异构性和预算限制,通常情况下服务器不能激励所有客户在所有训练轮次中参与训练(称为完全参与)。现有的联邦学习激励机制通常基于客户数据量或系统资源来激励固定数量的客户,因此整个训练过程只使用此客户子集进行联邦学习,导致由于数据异构性而产生的偏见模型。本文提出了一种基于随机客户参与的博弈论激励机制,其中服务器采用定制的定价策略,以获得不偏和高性能模型来激励不同的客户以不同的参与水平(概率)加入学习。每个客户对服务器的货币激励作出反应,通过选择其最佳参与水平来最大化其利润,该利润基于它产生的局部成本和其对全局模型的内在价值。为了有效地评估客户对模型性能的贡献,我们推导出一个新的收敛性边界,该边界分析预测了客户的任意参与水平及其异构数据如何影响模型性能。通过解决一个非凸优化问题,我们的分析揭示了内在价值导致双向支付的有趣可能性。使用真实数据集在硬件原型上进行的实验结果表明,我们的机制在为服务器实现更高的模型性能和为客户提高更高的利润方面具有优越性。