In this paper, we are interested in what we term the federated private bandits framework, that combines differential privacy with multi-agent bandit learning. We explore how differential privacy based Upper Confidence Bound (UCB) methods can be applied to multi-agent environments, and in particular to federated learning environments both in `master-worker' and `fully decentralized' settings. We provide a theoretical analysis on the privacy and regret performance of the proposed methods and explore the tradeoffs between these two.
翻译:在本文中,我们感兴趣的是我们所谓的联合私人强盗框架,它把不同的隐私与多剂强盗学习结合起来。我们探讨了基于隐私的差别性高信任圈(UBB)方法如何适用于多剂环境,特别是适用于“总经理-工人”和“完全分散”环境中的联合学习环境。我们从理论上分析了拟议方法的隐私和遗憾表现,并探讨了这两种方法之间的取舍。