Federated learning (FL) is an emerging distributed machine learning paradigm that stands out with its inherent privacy-preserving advantages. Heterogeneity is one of the core challenges in FL, which resides in the diverse user behaviors and hardware capacity across devices who participate in the training. Heterogeneity inherently exerts a huge influence on the FL training process, e.g., causing device unavailability. However, existing FL literature usually ignores the impacts of heterogeneity. To fill in the knowledge gap, we build FLASH, the first heterogeneity-aware FL platform. Based on FLASH and a large-scale user trace from 136k real-world users, we demonstrate the usefulness of FLASH in anatomizing the impacts of heterogeneity in FL by exploring three previously unaddressed research questions: whether and how can heterogeneity affect FL performance; how to configure a heterogeneity-aware FL system; and what are heterogeneity's impacts on existing FL optimizations. It shows that heterogeneity causes nontrivial performance degradation in FL from various aspects, and even invalidates some typical FL optimizations.
翻译:联邦学习(FL)是新兴的分布式机器学习模式,其内在的隐私保护优势突出。异质性是FL的核心挑战之一,它存在于参与培训的各种装置的不同用户行为和硬件能力中。异质性对FL培训过程具有巨大影响,例如造成设备无法使用。然而,现有的FL文献通常忽视异质性的影响。为了填补知识空白,我们建立了FLASH,这是第一个异质性能异性FL平台。基于FLASH和来自136k现实世界用户的大规模用户跟踪,我们通过探讨三个以前未解决的研究问题,展示了FLASH在解剖异性影响方面的益处:异性性性能是否以及如何影响FL的性能;如何配置异性性性能觉系统;以及异性性性对现有FL优化的影响。这甚至表明,异性性性性性性能导致FL的典型性能退化。