Although federated learning (FL) has recently been proposed for efficient distributed training and data privacy protection, it still encounters many obstacles. One of these is the naturally existing statistical heterogeneity among clients, making local data distributions non independently and identically distributed (i.e., non-iid), which poses challenges for model aggregation and personalization. For FL with a deep neural network (DNN), privatizing some layers is a simple yet effective solution for non-iid problems. However, which layers should we privatize to facilitate the learning process? Do different categories of non-iid scenes have preferred privatization ways? Can we automatically learn the most appropriate privatization way during FL? In this paper, we answer these questions via abundant experimental studies on several FL benchmarks. First, we present the detailed statistics of these benchmarks and categorize them into covariate and label shift non-iid scenes. Then, we investigate both coarse-grained and fine-grained network splits and explore whether the preferred privatization ways have any potential relations to the specific category of a non-iid scene. Our findings are exciting, e.g., privatizing the base layers could boost the performances even in label shift non-iid scenes, which are inconsistent with some natural conjectures. We also find that none of these privatization ways could improve the performances on the Shakespeare benchmark, and we guess that Shakespeare may not be a seriously non-iid scene. Finally, we propose several approaches to automatically learn where to aggregate via cross-stitch, soft attention, and hard selection. We advocate the proposed methods could serve as a preliminary try to explore where to privatize for a novel non-iid scene.
翻译:虽然最近提议采用联合学习(FL)来有效分发培训和数据隐私保护,但它仍然遇到许多障碍,其中之一是客户之间自然存在的统计差异性,使得当地数据分布不独立和完全分布(即非二d),给模型集成和个人化提出了挑战。对于具有深层神经网络(DNN)的FL, 将一些层次私有化是解决非二类问题的简单而有效的解决办法。然而,我们应该将哪些层次私有化以促进学习进程?不同类别的非二类私有化是否更倾向于私有化方式?在FL期间,我们能否自动地学习最适当的私有化方式?在本文件中,我们通过大量实验研究来回答这些问题(即非二类)。首先,我们提出这些基准的详细统计数据,并将其分类为隐蔽和标签变换非二类的场景。然后,我们调查一些粗略和精细的网络的分解,并探讨首类私有化方式是否与非二类特定类别的非二类私有化方式有潜在关系,我们甚至可以提出非二类私有化方式。我们的结论并非要改变,例如,而不能改变。