Federated learning (FL) has been increasingly considered to preserve data training privacy from eavesdropping attacks in mobile edge computing-based Internet of Thing (EdgeIoT). On the one hand, the learning accuracy of FL can be improved by selecting the IoT devices with large datasets for training, which gives rise to a higher energy consumption. On the other hand, the energy consumption can be reduced by selecting the IoT devices with small datasets for FL, resulting in a falling learning accuracy. In this paper, we formulate a new resource allocation problem for EdgeIoT to balance the learning accuracy of FL and the energy consumption of the IoT device. We propose a new federated learning-enabled twin-delayed deep deterministic policy gradient (FLDLT3) framework to achieve the optimal accuracy and energy balance in a continuous domain. Furthermore, long short term memory (LSTM) is leveraged in FL-DLT3 to predict the time-varying network state while FL-DLT3 is trained to select the IoT devices and allocate the transmit power. Numerical results demonstrate that the proposed FL-DLT3 achieves fast convergence (less than 100 iterations) while the FL accuracy-to-energy consumption ratio is improved by 51.8% compared to existing state-of-the-art benchmark.
 翻译:联邦学习(FL)越来越被考虑保护数据培训隐私,防止在移动边缘基于计算机的互联网(EdgeIoT)上偷听攻击事物(EdgeIoT),从而保护数据培训隐私。一方面,通过选择具有大型数据集的IoT设备进行培训,可以提高FL的学习准确性,从而增加能源消耗;另一方面,通过选择具有小数据集的IoT设备,可以减少能源消耗,从而降低学习准确性。在本文中,我们为EgeIoT制定了一个新的资源分配问题,以平衡FL的学习准确性和IoT装置的能源消耗。我们提出一个新的由FLT驱动的双延迟双向深确定性政策梯度框架(FLDLT3),以在连续域实现最佳准确性和能源平衡。此外,FL-D3中长期记忆(LTM)被利用来预测时间变异网络状态,而FL-D3则被训练为选择IOT装置和分配传输动力。我们建议采用新的Fmerical-LT(FLLT)的快速趋同率,而目前的Fmeral-LT则显示,而FILT比其快速趋同比率为50-LLT的精确比为快的精确度。