Multiple medical institutions collaboratively training a model using federated learning (FL) has become a promising solution for maximizing the potential of data-driven models, yet the non-independent and identically distributed (non-iid) data in medical images is still an outstanding challenge in real-world practice. The feature heterogeneity caused by diverse scanners or protocols introduces a drift in the learning process, in both local (client) and global (server) optimizations, which harms the convergence as well as model performance. Many previous works have attempted to address the non-iid issue by tackling the drift locally or globally, but how to jointly solve the two essentially coupled drifts is still unclear. In this work, we concentrate on handling both local and global drifts and introduce a new harmonizing framework called HarmoFL. First, we propose to mitigate the local update drift by normalizing amplitudes of images transformed into the frequency domain to mimic a unified imaging setting, in order to generate a harmonized feature space across local clients. Second, based on harmonized features, we design a client weight perturbation guiding each local model to reach a flat optimum, where a neighborhood area of the local optimal solution has a uniformly low loss. Without any extra communication cost, the perturbation assists the global model to optimize towards a converged optimal solution by aggregating several local flat optima. We have theoretically analyzed the proposed method and empirically conducted extensive experiments on three medical image classification and segmentation tasks, showing that HarmoFL outperforms a set of recent state-of-the-art methods with promising convergence behavior.
翻译:多个医疗机构合作培训了使用联合学习(FL)模式的模式,这已成为最大限度地发挥数据驱动模型潜力的一个有希望的解决办法,但医疗图像中不独立和同样分布的数据(非二d)仍然是现实世界实践中的一个突出挑战。 不同扫描仪或协议造成的特征差异化在本地(客户)和全球(服务器)优化的学习过程中造成了一种漂移,从而损害趋同和模型性能。许多以前的工作都试图通过处理本地或全球漂移,解决非二期问题,但如何共同解决两种基本上并存的流动,目前还不清楚。在这项工作中,我们集中处理本地和全球的漂移问题,并引入一个叫做“哈洛夫勒”的新的统一框架。首先,我们建议通过将图像的正常振荡度转换到频率域以模拟统一成一个统一的图像设置,从而在本地客户之间形成一个统一的地貌空间。 其次,根据统一的国家特征,我们设计一个客户权重点,用以指导每个本地模型达到一个基本相交错流的流流流流流。 我们集中处理本地的模型,一个平定最佳的平时程,一个最优化的路径,一个最优化的路径,一个最优化的平整的路径,一个最优化的平平整的平整的平整的平整的平整的平整的平整的平整的平整的平整的平整的平整的平整。