关于联邦医学成像学习方法的数据异异性实验研究 (An Experimental Study of Data Heterogeneity in Federated Learning Methods for Medical Imaging)

Federated learning enables multiple institutions to collaboratively train machine learning models on their local data in a privacy-preserving way. However, its distributed nature often leads to significant heterogeneity in data distributions across institutions. In this paper, we investigate the deleterious impact of a taxonomy of data heterogeneity regimes on federated learning methods, including quantity skew, label distribution skew, and imaging acquisition skew. We show that the performance degrades with the increasing degrees of data heterogeneity. We present several mitigation strategies to overcome performance drops from data heterogeneity, including weighted average for data quantity skew, weighted loss and batch normalization averaging for label distribution skew. The proposed optimizations to federated learning methods improve their capability of handling heterogeneity across institutions, which provides valuable guidance for the deployment of federated learning in real clinical applications.

翻译：联邦学习使多个机构能够以保护隐私的方式合作培训机器学习模型,以掌握当地数据,然而,其分布性往往导致各机构数据分布的显著差异性。在本文件中,我们调查了数据差异性分类制度对联邦学习方法的有害影响,包括数量斜线、标签分布斜线和图像获取斜线。我们表明,随着数据异质程度的提高,业绩会下降。我们提出了几项缓解战略,以克服数据差异性的业绩下降,包括数据数量斜线、加权损失和标签分布斜线的批量平均加权平均数。拟议的联邦学习方法优化提高了各机构处理异质性的能力,为在实际临床应用中应用联合学习提供了宝贵的指导。

相关内容

联邦学习

关注 199

联邦学习（Federated Learning）是一种新兴的人工智能基础技术，在 2016 年由谷歌最先提出，原本用于解决安卓手机终端用户在本地更新模型的问题，其设计目标是在保障大数据交换时的信息安全、保护终端数据和个人数据隐私、保证合法合规的前提下，在多参与方或多计算结点之间开展高效率的机器学习。其中，联邦学习可使用的机器学习算法不局限于神经网络，还包括随机森林等重要算法。联邦学习有望成为下一代人工智能协同算法和协作网络的基础。

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

65+阅读 · 2021年9月12日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

【UCSD-MIT】深度学习隐私综述论文，Privacy in Deep Learning: A Survey

专知会员服务

68+阅读 · 2020年4月28日

【医学图像处理中的因果性】52页ppt，Causality Matters in Medical Imaging

专知会员服务

60+阅读 · 2020年3月14日