Federated learning (FedL) has emerged as a popular technique for distributing model training over a set of wireless devices, via iterative local updates (at devices) and global aggregations (at the server). In this paper, we develop \textit{parallel successive learning} (PSL), which expands the FedL architecture along three dimensions: (i) Network, allowing decentralized cooperation among the devices via device-to-device (D2D) communications. (ii) Heterogeneity, interpreted at three levels: (ii-a) Learning: PSL considers heterogeneous number of stochastic gradient descent iterations with different mini-batch sizes at the devices; (ii-b) Data: PSL presumes a dynamic environment with data arrival and departure, where the distributions of local datasets evolve over time, captured via a new metric for model/concept drift. (ii-c) Device: PSL considers devices with different computation and communication capabilities. (iii) Proximity, where devices have different distances to each other and the access point. PSL considers the realistic scenario where global aggregations are conducted with idle times in-between them for resource efficiency improvements, and incorporates data dispersion and model dispersion with local model condensation into FedL. Our analysis sheds light on the notion of cold vs. warmed up models, and model inertia in distributed machine learning. We then propose network-aware dynamic model tracking to optimize the model learning vs. resource efficiency tradeoff, which we show is an NP-hard signomial programming problem. We finally solve this problem through proposing a general optimization solver. Our numerical results reveal new findings on the interdependencies between the idle times in-between the global aggregations, model/concept drift, and D2D cooperation configuration.
翻译:联邦学习( FedL) 已经成为通过迭代本地更新( 设备上) 和全球聚合( 服务器上), 在一组无线设备上分配模式培训的流行技术。 在本文中, 我们开发了\ textit{ parllel 相继学习} ( PSL), 将FedL架构扩展为三个层面:(一) 网络, 允许通过设备到设备到设备( D2D) 通信在设备之间分散合作。 (二) 多样化, 解释为三个层面:(二) 学习: PSL 考虑具有不同小批量规模的随机梯度D下移数; (二) B) 数据: PSL 假设一个动态环境, 数据到达和离开, 本地数据集的分布会随着时间变化而变化, 通过新的模型到模型和通信能力( D2D) 。 (二) 设备在模型上考虑设备与其它设备有不同距离, 以及接入点。 PSLSL 设想现实的情景是, 将全球聚合和智能流流流流流流数据显示时间。