量化标签噪声对联邦学习的影响 (Quantifying the Impact of Label Noise on Federated Learning) - 专知论文

会员服务 ·

0

噪声 · 全局模型 · 联邦学习 · 收敛速度 · 理论分析 ·

2023 年 4 月 3 日

Quantifying the Impact of Label Noise on Federated Learning

翻译：量化标签噪声对联邦学习的影响

Shuqi Ke,Chao Huang,Xin Liu

from arxiv, Accepted by The AAAI 2023 Workshop on Representation Learning for Responsible Human-Centric AI

Federated Learning (FL) is a distributed machine learning paradigm where clients collaboratively train a model using their local (human-generated) datasets. While existing studies focus on FL algorithm development to tackle data heterogeneity across clients, the important issue of data quality (e.g., label noise) in FL is overlooked. This paper aims to fill this gap by providing a quantitative study on the impact of label noise on FL. We derive an upper bound for the generalization error that is linear in the clients' label noise level. Then we conduct experiments on MNIST and CIFAR-10 datasets using various FL algorithms. Our empirical results show that the global model accuracy linearly decreases as the noise level increases, which is consistent with our theoretical analysis. We further find that label noise slows down the convergence of FL training, and the global model tends to overfit when the noise level is high.

翻译：联邦学习（FL）是一种分布式机器学习范例，客户端协作地使用其本地（人工生成的）数据集训练模型。虽然现有的研究侧重于FL算法的开发，以解决客户端之间的数据异构性，但FL中数据质量（例如，标签噪声）的重要问题被忽视了。本文旨在通过提供关于标签噪声对FL影响的定量研究来填补这一空白。我们推导出一个上限，它是客户端标签噪声水平的线性函数，用于一般化误差。然后，我们使用各种FL算法对MNIST和CIFAR-10数据集进行实验。我们的实证结果表明，全局模型精度随噪声水平的增加而线性下降，这与我们的理论分析一致。我们进一步发现，在噪声水平较高时，标签噪声会减慢FL训练的收敛速度，并且全局模型往往会过拟合。

0

相关内容

「联邦学习模型安全与隐私」研究进展

「联邦学习模型安全与隐私」研究进展

专知会员服务

69+阅读 · 2022年9月24日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

【CVPR2021】用随机标签的神经架构搜索

专知会员服务

12+阅读 · 2021年3月21日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知会员服务

89+阅读 · 2020年2月28日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

45+阅读 · 2020年12月2日

模型攻击：鲁棒性联邦学习研究的最新进展

模型攻击：鲁棒性联邦学习研究的最新进展

机器之心

34+阅读 · 2020年6月3日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知

20+阅读 · 2020年2月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于高光谱数据的交叉定标光谱特性差异订正

国家自然科学基金

0+阅读 · 2013年12月31日

信息与能量同时传输的多用户系统能效理论及优化方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于量子点填充的光子晶体光纤多参量荧光温度传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向物联网的无源UHF RFID系统传播模型及识别范围预测研究

国家自然科学基金

0+阅读 · 2012年12月31日

路易斯碱催化的贫电子烯（炔）烃环加成反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

高温质子交换膜燃料电池性能衰减机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于网络编码的无线网状网路由技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于网络编码的大规模无线网络容量分析

国家自然科学基金

1+阅读 · 2009年12月31日

Theoretically Principled Federated Learning for Balancing Privacy and Utility

Arxiv

0+阅读 · 2023年5月24日

Federated Variational Inference: Towards Improved Personalization and Generalization

Arxiv

0+阅读 · 2023年5月23日

Federated Transfer-Ordered-Personalized Learning for Driver Monitoring Application

Arxiv

0+阅读 · 2023年5月22日

On the Fairness Impacts of Private Ensembles Models

Arxiv

0+阅读 · 2023年5月19日

Towards the Practical Utility of Federated Learning in the Medical Domain

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

Semi-verified PAC Learning from the Crowd

Arxiv

0+阅读 · 2023年5月18日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

VIP会员

文章信息

相关主题

相关VIP内容

「联邦学习模型安全与隐私」研究进展

「联邦学习模型安全与隐私」研究进展

专知会员服务

69+阅读 · 2022年9月24日

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

【KDD22】DICE: 域攻击不变的因果学习以保护数据隐私、提升攻击迁移性和对抗鲁棒性

专知会员服务

12+阅读 · 2022年8月27日

【CVPR2021】用随机标签的神经架构搜索

专知会员服务

12+阅读 · 2021年3月21日

最新《联邦学习Federated Learning》报告，Federated Learning

最新《联邦学习Federated Learning》报告，Federated Learning

专知会员服务

89+阅读 · 2020年12月2日

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

【ICML2020】噪声在随机梯度下降中的泛化效益，On the Generalization Benefit of Noise in Stochastic Gradient Descent

专知会员服务

19+阅读 · 2020年6月29日

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

【综述】联邦学习的威胁，Threats to Federated Learning: A Survey

专知会员服务

80+阅读 · 2020年3月4日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知会员服务

89+阅读 · 2020年2月28日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

中文版4500字 | 数字战场：解读战争中的网络电磁行动

【新书】没有标签的数据：实用的无监督机器学习

【ICML2025】因果感知对比学习用于鲁棒的多变量时间序列异常检测

Nature：大脑中的多时间尺度强化学习

相关资讯

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

从NeurIPS 2022看域泛化：大规模实验分析和模型平均

PaperWeekly

0+阅读 · 2022年10月23日

最新《联邦学习Federated Learning》报告，47页ppt

最新《联邦学习Federated Learning》报告，47页ppt

专知

45+阅读 · 2020年12月2日

模型攻击：鲁棒性联邦学习研究的最新进展

模型攻击：鲁棒性联邦学习研究的最新进展

机器之心

34+阅读 · 2020年6月3日

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

【香港科技大学】联邦半监督学习综述，A Survey on Federated Semi-supervised Learning

专知

20+阅读 · 2020年2月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

42+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

17+阅读 · 2018年12月24日

相关论文

Theoretically Principled Federated Learning for Balancing Privacy and Utility

Arxiv

0+阅读 · 2023年5月24日

Federated Variational Inference: Towards Improved Personalization and Generalization

Arxiv

0+阅读 · 2023年5月23日

Federated Transfer-Ordered-Personalized Learning for Driver Monitoring Application

Arxiv

0+阅读 · 2023年5月22日

On the Fairness Impacts of Private Ensembles Models

Arxiv

0+阅读 · 2023年5月19日

Towards the Practical Utility of Federated Learning in the Medical Domain

Arxiv

0+阅读 · 2023年5月19日

A Survey of Federated Evaluation in Federated Learning

Arxiv

0+阅读 · 2023年5月19日

Semi-verified PAC Learning from the Crowd

Arxiv

0+阅读 · 2023年5月18日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

Data-Free Knowledge Distillation for Heterogeneous Federated Learning

Arxiv

12+阅读 · 2021年6月9日

Characterizing Impacts of Heterogeneity in Federated Learning upon Large-Scale Smartphone Data

Arxiv

12+阅读 · 2021年2月21日

相关基金

基于SURE/PURE准则的图像盲反卷积算法研究

国家自然科学基金

3+阅读 · 2013年12月31日

基于高光谱数据的交叉定标光谱特性差异订正

国家自然科学基金

0+阅读 · 2013年12月31日

信息与能量同时传输的多用户系统能效理论及优化方法

国家自然科学基金

0+阅读 · 2013年12月31日

基于量子点填充的光子晶体光纤多参量荧光温度传感器研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向物联网的无源UHF RFID系统传播模型及识别范围预测研究

国家自然科学基金

0+阅读 · 2012年12月31日

路易斯碱催化的贫电子烯（炔）烃环加成反应的机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

量子discord及其在量子计算中的研究

国家自然科学基金

1+阅读 · 2011年12月31日

高温质子交换膜燃料电池性能衰减机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于网络编码的无线网状网路由技术研究

国家自然科学基金

0+阅读 · 2010年12月31日

基于网络编码的大规模无线网络容量分析

国家自然科学基金

1+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员