In federated learning (FL), classifiers (e.g., deep networks) are trained on datasets from multiple centers without exchanging data across them, and thus improves sample efficiency. In the classical setting of FL, the same labeling criterion is usually employed across all centers being involved in training. This constraint greatly limits the applicability of FL. For example, standards used for disease diagnosis are more likely to be different across clinical centers, which mismatches the classical FL setting. In this paper, we consider an important yet under-explored setting of FL, namely FL with mixed-type labels where different labeling criteria can be employed by various centers, leading to inter-center label space differences and challenging existing FL methods designed for the classical setting. To effectively and efficiently train models with mixed-type labels, we propose a theory-guided and model-agnostic approach that can make use of the underlying correspondence between those label spaces and can be easily combined with various FL methods such as FedAvg. We present convergence analysis based on over-parameterized ReLU networks. We show that the proposed method can achieve linear convergence in label projection, and demonstrate the impact of the parameters of our new setting on the convergence rate. The proposed method is evaluated and the theoretical findings are validated on benchmark and medical datasets.
翻译:在联合学习(FL)中,分类人员(例如深网络)在多个中心进行数据集培训,而不相互交换数据,从而提高抽样效率。在传统FL环境中,参与培训的所有中心通常都采用同样的标签标准。这种限制极大地限制了FL的适用性。例如,用于疾病诊断的标准在临床中心之间更有可能不同,这与传统的FL设置不匹配。在本文中,我们认为FL是一个重要但尚未开发的FL设置,即具有混合型标签,各中心可以采用不同的标签标准,导致中心间标签空间差异,并对为传统设置设计的现有FL方法提出挑战。为了有效和高效地培训混合型标签模式模型,我们建议采用理论指南和模型-诺知性方法,利用这些标签空间之间的基本对应关系,并容易与FedAvg等各种FL方法相结合。我们根据多度的分类类型标签标签进行趋同分析,其中不同标签标准可以由不同中心使用,导致中心之间的空间空间差异,并对为传统设置的现有FLU提供不同的标签方法提出挑战。我们提议的方法可以实现线性趋同率率,我们提出的指标的预测和标准是:关于我们提出的指标的标准化指标的标准化分析,用以显示我们的基准分析。