Federated learning is used for decentralized training of machine learning models on a large number (millions) of edge mobile devices. It is challenging because mobile devices often have limited communication bandwidth and local computation resources. Therefore, improving the efficiency of federated learning is critical for scalability and usability. In this paper, we propose to leverage partially trainable neural networks, which freeze a portion of the model parameters during the entire training process, to reduce the communication cost with little implications on model performance. Through extensive experiments, we empirically show that Federated learning of Partially Trainable neural networks (FedPT) can result in superior communication-accuracy trade-offs, with up to $46\times$ reduction in communication cost, at a small accuracy cost. Our approach also enables faster training, with a smaller memory footprint, and better utility for strong differential privacy guarantees. The proposed FedPT method can be particularly interesting for pushing the limitations of overparameterization in on-device learning.
翻译:联邦学习用于对大量(百万)边缘移动设备进行机器学习模型的分散化培训,具有挑战性,因为移动设备往往具有有限的通信带宽和本地计算资源。因此,提高联合学习的效率对于可扩缩和可用性至关重要。在本文中,我们提议利用部分可训练的神经网络,在整个培训过程中冻结部分模型参数,以减少通信成本,对模型性能影响不大。通过广泛的实验,我们从经验上表明,联邦学习部分可训练神经网络(FedPT)可以带来较高的通信准确性交易,通信成本降低46美元,成本小一些。我们的方法还有助于更快的培训,减少记忆足迹,并为差异很大的隐私保障提供更好的效用。 拟议的FedPT方法对于在设计性学习中推动超分计的限制可能特别有趣。