A fundamental issue for federated learning (FL) is how to achieve optimal model performance under highly dynamic communication environments. This issue can be alleviated by the fact that modern edge devices usually can connect to the edge FL server via multiple communication channels (e.g., 4G, LTE and 5G). However, having an edge device send copies of local models to the FL server along multiple channels is redundant, time-consuming, and would waste resources (e.g., bandwidth, battery life and monetary cost). In this paper, motivated by the layered coding techniques in video streaming, we propose a novel FL framework called layered gradient compression (LGC). Specifically, in LGC, local gradients from a device is coded into several layers and each layer is sent to the FL server along a different channel. The FL server aggregates the received layers of local gradients from devices to update the global model, and sends the result back to the devices. We prove the convergence of LGC, and formally define the problem of resource-efficient federated learning with LGC. We then propose a learning based algorithm for each device to dynamically adjust its local computation (i.e., the number of local stochastic descent) and communication decisions (i.e.,the compression level of different layers and the layer to channel mapping) in each iteration. Results from extensive experiments show that using our algorithm, LGC significantly reduces the training time, improves the resource utilization, while achieving a similar accuracy, compared with well-known FL mechanisms.
翻译:联邦化学习(FL)的根本问题是,如何在高度动态的通信环境中实现最佳模型性能。由于现代边缘设备通常可以通过多个通信渠道(如4G、LTE和5G)与边缘FL服务器连接,因此这一问题可以缓解。然而,如果有一个边缘设备通过多个渠道(如4G、LTE和5G)向FL服务器发送本地模型副本,则该设备将本地模型副本发送到FL服务器,这是多余、耗时和浪费资源(如带宽、电池寿命和货币成本)。在本文中,由于视频流流的分层化编码技术,我们提议了一个名为分层梯度压缩(LGC)的新FL框架。具体而言,在LGC中,一个设备本地梯度的梯度通常可以编码为几个层次,每个层次被发送到FL服务器。FL服务器综合从设备接收到的本地梯度层层,以更新全球模型,并将结果发回装置。我们证明了LGC的趋同,并正式界定了与LGC进行资源效率化的学习问题。我们随后建议为每个设备的每个装置提供一种基于精度的算法的计算,然后将每个设备进行精细的计算,从地方级进行地方级的计算,从地方级的顺序到地方级的计算。