In this paper, we study the tradeoffs between time-speedup and the number of communication rounds of the learning process in the collaborative learning model on non-IID data, where multiple agents interact with possibly different environments and they want to learn an objective in the aggregated environment. We use a basic problem in bandit theory called best arm identification in multi-armed bandits as a vehicle to deliver the following conceptual message: Collaborative learning on non-IID data is provably more difficult than that on IID data. In particular, we show the following: a) The speedup in the non-IID data setting can be less than $1$ (that is, a slowdown). When the number of rounds $R = O(1)$, we will need at least a polynomial number of agents (in terms of the number of arms) to achieve a speedup greater than $1$. This is in sharp contrast with the IID data setting, in which the speedup is always at least $1$ when $R \ge 2$ regardless of number of agents. b) Adaptivity in the learning process cannot help much in the non-IID data setting. This is in sharp contrast with the IID data setting, in which to achieve the same speedup, the best non-adaptive algorithm requires a significantly larger number of rounds than the best adaptive algorithm. In the technique space, we have further developed the generalized round elimination technique introduced in arXiv:1904.03293. We show that implicit representations of distribution classes can be very useful when working with complex hard input distributions and proving lower bounds directly for adaptive algorithms.
翻译:在本文中,我们研究了非IID数据合作学习模式中学习过程的时间加速与交流周期数目之间的权衡,在这种过程中,多个代理商与可能不同的环境相互作用,他们希望在综合环境中学习一个目标。我们使用一个称为多武装匪徒最佳手臂识别的土匪理论的基本问题,作为传递以下概念信息的工具:合作学习非IID数据比ID数据难得多。特别是,我们显示以下各点:(a) 非IID数据设置的加速速度可能低于1美元(即减速)。当多个代理商与可能不同的环境发生互动,他们希望在综合环境中学习一个目标。我们使用一个叫作多武装匪徒最佳手臂识别的强盗理论,作为传递以下概念信息的工具:合作学习非IID数据学习过程的加速速度总是至少为1美元,而不论代理商的数量为何,在非IID数据设置的直流速度上,这与我们最难的递增速度相比,这种速度与最慢的递增速度相比,在ID数据排序中,这与最大幅度的递增的递算法是:我们更难的推算。