Federated learning (FL) allows agents to jointly train a global model without sharing their local data. However, due to the heterogeneous nature of local data, it is challenging to optimize or even define fairness of the trained global model for the agents. For instance, existing work usually considers accuracy equity as fairness for different agents in FL, which is limited, especially under the heterogeneous setting, since it is intuitively "unfair" to enforce agents with high-quality data to achieve similar accuracy to those who contribute low-quality data, which may discourage the agents from participating in FL. In this work, we propose a formal FL fairness definition, fairness via agent-awareness (FAA), which takes different contributions of heterogeneous agents into account. Under FAA, the performance of agents with high-quality data will not be sacrificed just due to the existence of large amounts of agents with low-quality data. In addition, we propose a fair FL training algorithm based on agent clustering (FOCUS) to achieve fairness in FL measured by FAA. Theoretically, we prove the convergence and optimality of FOCUS under mild conditions for linear and general convex loss functions with bounded smoothness. We also prove that FOCUS always achieves higher fairness in terms of FAA compared with standard FedAvg under both linear and general convex loss functions. Empirically, we show that on four FL datasets, including synthetic data, images, and texts, FOCUS achieves significantly higher fairness in terms of FAA while maintaining competitive prediction accuracy compared with FedAvg and state-of-the-art fair FL algorithms.
翻译:联邦学习(FL)允许代理商在不分享当地数据的情况下联合培训一个全球模型,然而,由于当地数据的性质不同,优化甚至界定经过培训的全球模型对代理商的公平性具有挑战性,例如,现有工作通常认为准确性是FL中不同代理商的公平性,而这种公平性是有限的,特别是在多样化环境下,因为执行具有高质量数据的代理商,以便实现与提供低质量数据的代理商相似的准确性,这可能会妨碍代理商参与FL的公平性。 然而,在这项工作中,我们提出了一个正式的FL公平性定义,即通过代理商认识实现公平性(FAA)的公平性(AA),将不同不同不同的代理商的贡献考虑在内。在FA中,具有高质量数据的大量代理商的绩效不会仅仅因为存在而遭到牺牲。 此外,我们提议基于代理商集群(FOCUS)的公平性培训算法,从理论上证明FOC在较轻的条件下保持FOC的一致性和优化性(FA)损失的公平性,同时证明我们始终在直线性与一般的准确性A的文本中都实现了公平性。