通过在联邦学习中进行非技术性自我知识蒸馏,保护全球知识 (Preservation of the Global Knowledge by Not-True Self Knowledge Distillation in Federated Learning)

In Federated Learning (FL), a strong global model is collaboratively learned by aggregating the clients' locally trained models. Although this allows no need to access clients' data directly, the global model's convergence often suffers from data heterogeneity. This paper suggests that forgetting could be the bottleneck of global convergence. We observe that fitting on biased local distribution shifts the feature on global distribution and results in forgetting of global knowledge. We consider this phenomenon as an analogy to Continual Learning, which also faces catastrophic forgetting when fitted on the new task distribution. Based on our findings, we hypothesize that tackling down the forgetting in local training relives the data heterogeneity problem. To this end, we propose a simple yet effective framework Federated Local Self-Distillation (FedLSD), which utilizes the global knowledge on locally available data. By following the global perspective on local data, FedLSD encourages the learned features to preserve global knowledge and have consistent views across local models, thus improving convergence without compromising data privacy. Under our framework, we further extend FedLSD to FedLS-NTD, which only considers the not-true class signals to compensate noisy prediction of the global model. We validate that both FedLSD and FedLS-NTD significantly improve the performance in standard FL benchmarks in various setups, especially in the extreme data heterogeneity cases.

翻译：在联邦学习联盟(FL)中,通过汇集客户在当地培训的模型,可以合作学习一个强大的全球模式。虽然这样不需要直接获取客户的数据,但全球模式的趋同往往会因数据差异性而受到影响。本文指出,忘记可能是全球趋同的瓶颈。我们认为,适应偏差的地方分配会改变全球分布和结果的特征,从而忘记全球知识;我们认为,这种现象与Continual Learning(Continational Learning)相似,在适应新的任务分配时,该现象也面临灾难性的遗忘。根据我们的调查结果,我们假设解决当地培训中忘记的数据重复的问题会影响数据异质性。为此,我们提出一个简单而有效的框架(Fed-LSD),利用当地数据的全球知识。根据对当地数据的全球观点,FedLSD鼓励学习的特性,以保存全球知识,并在不损及数据隐私的情况下提高趋同性。根据我们的框架,我们进一步将FDLSD-NTD(FD-NDSD)的FD-N-SD标准模型扩展到FD-SD(我们仅考虑FDDL)中不甚高的模型,特别是FDLSDSD-SDSDDDDDD的模型。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日