预训练在联邦学习中的重要性和适用性 (On the Importance and Applicability of Pre-Training for Federated Learning) - 专知论文

会员服务 ·

0

Learning · MoDELS · 联邦学习 · Performance · CASES ·

2023 年 3 月 23 日

On the Importance and Applicability of Pre-Training for Federated Learning

翻译：预训练在联邦学习中的重要性和适用性

Hong-You Chen,Cheng-Hao Tu,Ziwei Li,Han-Wei Shen,Wei-Lun Chao

from arxiv, Accepted to ICLR 2023

Pre-training is prevalent in nowadays deep learning to improve the learned model's performance. However, in the literature on federated learning (FL), neural networks are mostly initialized with random weights. These attract our interest in conducting a systematic study to explore pre-training for FL. Across multiple visual recognition benchmarks, we found that pre-training can not only improve FL, but also close its accuracy gap to the counterpart centralized learning, especially in the challenging cases of non-IID clients' data. To make our findings applicable to situations where pre-trained models are not directly available, we explore pre-training with synthetic data or even with clients' data in a decentralized manner, and found that they can already improve FL notably. Interestingly, many of the techniques we explore are complementary to each other to further boost the performance, and we view this as a critical result toward scaling up deep FL for real-world applications. We conclude our paper with an attempt to understand the effect of pre-training on FL. We found that pre-training enables the learned global models under different clients' data conditions to converge to the same loss basin, and makes global aggregation in FL more stable. Nevertheless, pre-training seems to not alleviate local model drifting, a fundamental problem in FL under non-IID data.

翻译：预训练在当今的深度学习中广泛应用，以提高所学模型的性能。然而，在联邦学习（FL）的文献中，神经网络大多以随机权重进行初始化。这引起了我们的兴趣，进行系统的研究，探索 FL 中的预训练。在多个视觉识别基准测试中，我们发现预训练不仅可以改进 FL，而且还可以缩小其准确度与集中式学习的差距，尤其是在非 IID 客户端数据的挑战性情况下。为了使我们的研究成果适用于没有直接可用的预训练模型的情况，我们探索了使用合成数据甚至分散式地使用客户端数据进行预训练，并发现它们可以显著提高 FL 的表现。有趣的是，我们探索的许多技术相互补充，可以进一步提高性能。我们认为这是 FL 实现规模化的关键结果，适用于实际应用。在文章中我们做出了结论，尝试理解 FL 中预训练的影响。我们发现，预训练使得在不同客户端数据条件下学习到的全局模型收敛于相同的损失盆地，并使 FL 中的全局聚合更加稳定。然而，预训练似乎无法减轻 FL 中非 IID 数据下的本地模型漂移问题。

0

相关内容

Learning

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

46+阅读 · 2020年7月29日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

【NLP| 推荐文章】从统一文本到文本探讨迁移学习的局限性（Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer）

【NLP| 推荐文章】从统一文本到文本探讨迁移学习的局限性（Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer）

专知会员服务

20+阅读 · 2019年11月24日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

47+阅读 · 2020年12月2日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

随机接入中的分布式功率控制和数据包编码传输

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

多任务学习的理论分析与应用

国家自然科学基金

6+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

被动微波遥感土壤水分反演精度与空间异质特征的相关性研究

国家自然科学基金

0+阅读 · 2013年12月31日

燃料电池热裂解铁催化剂活性中心结构测定的化学方法与模型

国家自然科学基金

0+阅读 · 2013年12月31日

弱标注下基于主动学习的检测器适应问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

无溶剂合成介孔硅铝催化材料

国家自然科学基金

0+阅读 · 2012年12月31日

径向基函数插值中心的自适应选取方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

随机过程的最优控制、稳定性理论及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

On Noisy Evaluation in Federated Hyperparameter Tuning

Arxiv

0+阅读 · 2023年5月15日

On the Relationship Between Explanation and Prediction: A Causal View

Arxiv

0+阅读 · 2023年5月12日

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

Arxiv

0+阅读 · 2023年5月11日

PerFedRec++: Enhancing Personalized Federated Recommendation with Self-Supervised Pre-Training

Arxiv

0+阅读 · 2023年5月11日

Survey of Federated Learning Models for Spatial-Temporal Mobility Applications

Arxiv

0+阅读 · 2023年5月10日

FedDWA: Personalized Federated Learning with Online Weight Adjustment

Arxiv

0+阅读 · 2023年5月10日

A Survey on Heterogeneous Federated Learning

Arxiv

20+阅读 · 2022年10月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Personalized Cross-Silo Federated Learning on Non-IID Data

Personalized Cross-Silo Federated Learning on Non-IID Data

Arxiv

10+阅读 · 2021年1月7日

VIP会员

文章信息

相关主题

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

AAAI2021 | 图神经网络的异质图结构学习，Heterogeneous Graph Structure Learning for Graph Neural Networks

专知会员服务

92+阅读 · 2021年1月20日

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

图像分类半监督自监督无监督学习综述，A survey on Semi-, Self- and Unsupervised Learning for Image Classification

专知会员服务

46+阅读 · 2020年7月29日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

机器学习隐私综述论文，An Overview of Privacy in Machine Learning

专知会员服务

81+阅读 · 2020年5月20日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

【NLP| 推荐文章】从统一文本到文本探讨迁移学习的局限性（Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer）

【NLP| 推荐文章】从统一文本到文本探讨迁移学习的局限性（Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer）

专知会员服务

20+阅读 · 2019年11月24日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《复杂工程系统模型驱动设计决策支持系统：早期设计阶段挑战》最新138页

《日本陆上自卫队2040年作战方式与未来作战研究》最新23页slides

人工智能作为战争武器

《后勤保障》最新23页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

47+阅读 · 2020年12月2日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

基于深度学习的医疗影像论文汇总（Deep Learning Papers on Medical Image Analysis）

AI研习社

17+阅读 · 2017年10月21日

【推荐】图像分类必读开创性论文汇总

【推荐】图像分类必读开创性论文汇总

机器学习研究会

14+阅读 · 2017年8月15日

相关论文

On Noisy Evaluation in Federated Hyperparameter Tuning

Arxiv

0+阅读 · 2023年5月15日

On the Relationship Between Explanation and Prediction: A Causal View

Arxiv

0+阅读 · 2023年5月12日

SemEval-2023 Task 7: Multi-Evidence Natural Language Inference for Clinical Trial Data

Arxiv

0+阅读 · 2023年5月11日

PerFedRec++: Enhancing Personalized Federated Recommendation with Self-Supervised Pre-Training

Arxiv

0+阅读 · 2023年5月11日

Survey of Federated Learning Models for Spatial-Temporal Mobility Applications

Arxiv

0+阅读 · 2023年5月10日

FedDWA: Personalized Federated Learning with Online Weight Adjustment

Arxiv

0+阅读 · 2023年5月10日

A Survey on Heterogeneous Federated Learning

Arxiv

20+阅读 · 2022年10月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Model-Contrastive Federated Learning

Arxiv

10+阅读 · 2021年3月30日

Personalized Cross-Silo Federated Learning on Non-IID Data

Personalized Cross-Silo Federated Learning on Non-IID Data

Arxiv

10+阅读 · 2021年1月7日

相关基金

随机接入中的分布式功率控制和数据包编码传输

国家自然科学基金

0+阅读 · 2015年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

多任务学习的理论分析与应用

国家自然科学基金

6+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

被动微波遥感土壤水分反演精度与空间异质特征的相关性研究

国家自然科学基金

0+阅读 · 2013年12月31日

燃料电池热裂解铁催化剂活性中心结构测定的化学方法与模型

国家自然科学基金

0+阅读 · 2013年12月31日

弱标注下基于主动学习的检测器适应问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

无溶剂合成介孔硅铝催化材料

国家自然科学基金

0+阅读 · 2012年12月31日

径向基函数插值中心的自适应选取方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

随机过程的最优控制、稳定性理论及其应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员