The practice of applying several local updates before aggregation across clients has been empirically shown to be a successful approach to overcoming the communication bottleneck in Federated Learning (FL). Such methods are usually implemented by having clients perform one or more epochs of local training per round while randomly reshuffling their finite dataset in each epoch. Data imbalance, where clients have different numbers of local training samples, is ubiquitous in FL applications, resulting in different clients performing different numbers of local updates in each round. In this work, we propose a general recipe, FedShuffle, that better utilizes the local updates in FL, especially in this regime encompassing random reshuffling and heterogeneity. FedShuffle is the first local update method with theoretical convergence guarantees that incorporates random reshuffling, data imbalance, and client sampling - features that are essential in large-scale cross-device FL. We present a comprehensive theoretical analysis of FedShuffle and show, both theoretically and empirically, that it does not suffer from the objective function mismatch that is present in FL methods that assume homogeneous updates in heterogeneous FL setups, such as FedAvg (McMahan et al., 2017). In addition, by combining the ingredients above, FedShuffle improves upon FedNova (Wang et al., 2020), which was previously proposed to solve this mismatch. Similar to Mime (Karimireddy et al., 2020), we show that FedShuffle with momentum variance reduction (Cutkosky & Orabona, 2019) improves upon non-local methods under a Hessian similarity assumption.
翻译:实践证明,在客户汇总之前应用若干本地更新的做法是克服联邦学习联合会(FL)沟通瓶颈的成功方法。这种方法通常通过客户每轮进行一个或多个地方培训,同时随机调整其有限的数据集。在FL应用程序中,客户拥有不同数量的地方培训样本的数据不平衡无处不在,导致不同的客户在每轮地方更新数量不同。在这项工作中,我们提议采用一个通配方FedShuffle,以更好地利用FL的本地更新,特别是包括随机调整和异质性在内的制度。FedShuffle是第一个本地更新方法,其理论整合保证包含随机调整、数据不平衡和客户抽样等要素。在FL应用程序中,数据失衡导致不同的客户在每轮地方更新数量不同。我们提出了FedShuffle(FedShuffle)的全面理论分析,在理论上和经验上都表明,它并不因FLL的客观功能不匹配而受到影响,特别是在这个制度中,包括随机调整、数据不平衡和客户抽样,在FedS & Al-M 上显示,这种标准化的更新是FM-M的。