木屋:实用、私营和可扩展的联邦学习 (Papaya: Practical, Private, and Scalable Federated Learning)

Dzmitry Huba,John Nguyen,Kshitiz Malik,Ruiyu Zhu,Mike Rabbat,Ashkan Yousefpour,Carole-Jean Wu,Hongyuan Zhan,Pavel Ustinov,Harish Srinivas,Kaikai Wang,Anthony Shoumikhin,Jesik Min,Mani Malek

Cross-device Federated Learning (FL) is a distributed learning paradigm with several challenges that differentiate it from traditional distributed learning, variability in the system characteristics on each device, and millions of clients coordinating with a central server being primary ones. Most FL systems described in the literature are synchronous - they perform a synchronized aggregation of model updates from individual clients. Scaling synchronous FL is challenging since increasing the number of clients training in parallel leads to diminishing returns in training speed, analogous to large-batch training. Moreover, stragglers hinder synchronous FL training. In this work, we outline a production asynchronous FL system design. Our work tackles the aforementioned issues, sketches of some of the system design challenges and their solutions, and touches upon principles that emerged from building a production FL system for millions of clients. Empirically, we demonstrate that asynchronous FL converges faster than synchronous FL when training across nearly one hundred million devices. In particular, in high concurrency settings, asynchronous FL is 5x faster and has nearly 8x less communication overhead than synchronous FL.

翻译：跨联邦学习(FL)是一种分布式的学习模式,它与传统分布式学习不同,每个设备系统特性的变异性,以及数百万客户与中央服务器协调是主要服务器,文献中描述的大多数FL系统是同步的,它们同步地汇总了个别客户的模型更新。 Slap 同步FL具有挑战性,因为同步的FL同时增加客户培训的数量会减少培训速度的回报,类似于大型批量培训。此外,挤压器会阻碍同步FL培训。在这项工作中,我们概述了一种不同步的FL系统设计。我们的工作解决了上述问题,绘制了一些系统设计挑战及其解决方案的草图,并触及了为数百万客户建立FL生产系统时产生的原则。我们很生动地表明,在近1亿个设备的培训中,不同步的FL比同步的FL同步速度要快。特别是在高通货币环境中,由于同步FL速度快5x快,通信距离近8x高。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日