The limited communication resources, e.g., bandwidth and energy, and data heterogeneity across devices are two of the main bottlenecks for federated learning (FL). To tackle these challenges, we first devise a novel FL framework with partial model aggregation (PMA), which only aggregates the lower layers of neural networks responsible for feature extraction while the upper layers corresponding to complex pattern recognition remain at devices for personalization. The proposed PMA-FL is able to address the data heterogeneity and reduce the transmitted information in wireless channels. We then obtain a convergence bound of the framework under a non-convex loss function setting. With the aid of this bound, we define a new objective function, named the scheduled data sample volume, to transfer the original inexplicit optimization problem into a tractable one for device scheduling, bandwidth allocation, computation and communication time division. Our analysis reveals that the optimal time division is achieved when the communication and computation parts of PMA-FL have the same power. We also develop a bisection method to solve the optimal bandwidth allocation policy and use the set expansion algorithm to address the optimal device scheduling. Compared with the state-of-the-art benchmarks, the proposed PMA-FL improves 2.72% and 11.6% accuracy on two typical heterogeneous datasets, i.e., MINIST and CIFAR-10, respectively. In addition, the proposed joint dynamic device scheduling and resource optimization approach achieve slightly higher accuracy than the considered benchmarks, but they provide a satisfactory energy and time reduction: 29% energy or 20% time reduction on the MNIST; and 25% energy or 12.5% time reduction on the CIFAR-10.
翻译:有限的通信资源,例如带宽和能源,以及各种设备的数据异质性,是联合学习(FL)的两个主要瓶颈。为了应对这些挑战,我们首先设计了一个新的FL框架,其中含有部分模型汇总(PMA),我们首先设计了一个新的FL框架,其中仅汇总了负责地貌提取的神经网络较低层,而与复杂模式识别相对应的上层则仍然留在个性化装置中。拟议的PMA-FL能够解决数据异质性,减少无线频道传送的信息。我们随后在非convex损失功能设置下,我们获得了框架的趋同。我们在这一约束的帮助下,我们定义了一个新的目标功能,并命名了预定的数据样本数量,将最初的不完全优化问题转换成一个可移动的用于设备调度、带宽分配、计算和通信时间分化的系统。我们的分析表明,当PMA-FL部分的通信和计算具有同样能力时,最佳的带宽分配方法可以解决最优化的带宽分配政策,并使用设定的扩展算法来应对最佳的装置的更精确的时间安排。在2级中,比IML-72或IMLIMLIMA的节中,分别地实现了降低成本。在IM-IM-10-10的进度上,分别地,在2级中,降低了的节能-IM-IM-时间轴上分别地改进了成本-IM-IM-IM-时间轴上,改进了2的节节节制,降低了。