Cross-silo federated learning (FL) is a typical FL that enables organizations(e.g., financial or medical entities) to train global models on isolated data. Reasonable incentive is key to encouraging organizations to contribute data. However, existing works on incentivizing cross-silo FL lack consideration of the environmental dynamics (e.g., precision of the trained global model and data owned by uncertain clients during the training processes). Moreover, most of them assume that organizations share private information, which is unrealistic. To overcome these limitations, we propose a novel adaptive mechanism for cross-silo FL, towards incentivizing organizations to contribute data to maximize their long-term payoffs in a real dynamic training environment. The mechanism is based on multi-agent reinforcement learning, which learns near-optimal data contribution strategy from the history of potential games without organizations' private information. Experiments demonstrate that our mechanism achieves adaptive incentive and effectively improves the long-term payoffs for organizations.
翻译:跨部门联合学习(FL)是一个典型的FL,它使各组织(例如金融或医疗实体)能够就孤立数据培训全球模型。合理的奖励是鼓励各组织提供数据的关键。然而,现有的鼓励跨筒行动FL的工作缺乏对环境动态的考虑(例如,经过培训的全球模型的精确度和不确定客户在培训过程中拥有的数据)。此外,它们大多认为各组织共享私人信息是不现实的。为了克服这些限制,我们提议了一个新的跨筒行动FL适应机制,以激励各组织提供数据,在真正的动态培训环境中最大限度地扩大长期回报。该机制以多剂强化学习为基础,在没有组织私人信息的情况下从潜在游戏的历史中学习接近最佳的数据贡献战略。实验表明,我们的机制实现了适应性激励,并有效地改善了各组织的长期回报。