Online federated learning (FL) enables geographically distributed devices to learn a global shared model from locally available streaming data. Most online FL literature considers a best-case scenario regarding the participating clients and the communication channels. However, these assumptions are often not met in real-world applications. Asynchronous settings can reflect a more realistic environment, such as heterogeneous client participation due to available computational power and battery constraints, as well as delays caused by communication channels or straggler devices. Further, in most applications, energy efficiency must be taken into consideration. Using the principles of partial-sharing-based communications, we propose a communication-efficient asynchronous online federated learning (PAO-Fed) strategy. By reducing the communication overhead of the participants, the proposed method renders participation in the learning task more accessible and efficient. In addition, the proposed aggregation mechanism accounts for random participation, handles delayed updates and mitigates their effect on accuracy. We prove the first and second-order convergence of the proposed PAO-Fed method and obtain an expression for its steady-state mean square deviation. Finally, we conduct comprehensive simulations to study the performance of the proposed method on both synthetic and real-life datasets. The simulations reveal that in asynchronous settings, the proposed PAO-Fed is able to achieve the same convergence properties as that of the online federated stochastic gradient while reducing the communication overhead by 98 percent.
翻译:在线联邦学习可以使地理位置分布的设备从本地可用的流数据中学习全局共享模型。大多数在线联邦学习文献都是在考虑参与客户和通信渠道最佳情况下进行的。然而,在现实世界的应用程序中,这些假设通常不成立。异步设置更能反映实际环境,如由于计算能力和电池限制导致的异构客户参与以及由通信渠道或落后设备导致的延迟等。此外,在大多数应用程序中,必须考虑能源效率。使用基于部分共享的通信原理,提出了一种通信高效的异步在线联邦学习(PAO-Fed)策略。通过减少参与者的通信开销,所提出的方法使参与学习任务变得更加容易和高效。此外,所提出的聚合机制能应对随机参与、处理延迟的更新并减轻其对准确性的影响。证明了所提出的PAO-Fed方法的一阶和二阶收敛性,并得出了其稳态均方偏差的表达式。最后,我们进行了综合仿真,研究了所提出的方法在合成数据集和现实生活数据集上的性能。仿真结果表明,在异步设置下,所提出的PAO-Fed能够实现与在线联邦随机梯度相同的收敛性能,并将通信开销减少了98%。