As end-user device capability increases and demand for intelligent services at the Internet's edge rise, distributed learning has emerged as a key enabling technology. Existing approaches like federated learning (FL) and decentralized FL (DFL) enable distributed learning among clients, while gossip learning (GL) approaches have emerged to address the potential challenges in resource-constrained, connectivity-challenged infrastructure-less environments. However, most distributed learning approaches assume largely homogeneous data distributions and may not consider or exploit the heterogeneity of clients and their underlying data distributions. This paper introduces Chisme, a novel fully decentralized distributed learning algorithm designed to address the challenges of implementing robust intelligence in network edge contexts characterized by heterogeneous data distributions, episodic connectivity, and sparse network infrastructure. Chisme leverages cosine similarity-based data affinity heuristics calculated from received model exchanges to inform how much influence received models have when merging into the local model. By doing so, it facilitates stronger merging influence between clients with more similar model learning progressions, enabling clients to strategically balance between broader collaboration to build more general knowledge and more selective collaboration to build specific knowledge. We evaluate Chisme against contemporary approaches using image recognition and time-series prediction scenarios while considering different network connectivity conditions, representative of real-world distributed intelligent systems. Our experiments demonstrate that Chisme outperforms state-of-the-art edge intelligence approaches in almost every case -- clients using Chisme exhibit faster training convergence, lower final loss after training, and lower performance disparity between clients.
翻译:随着终端用户设备能力提升及互联网边缘智能服务需求增长,分布式学习已成为关键使能技术。现有方法如联邦学习(FL)与去中心化联邦学习(DFL)支持客户端间的分布式学习,而流言学习(GL)方法则针对资源受限、连接困难、无基础设施环境中的潜在挑战应运而生。然而,大多数分布式学习方法假设数据分布基本同质,未能充分考虑或利用客户端及其底层数据分布的异质性。本文提出Chisme——一种新颖的完全去中心化分布式学习算法,旨在解决在具有异质数据分布、间歇性连接和稀疏网络基础设施特征的网络边缘场景中实现鲁棒智能所面临的挑战。Chisme基于从接收的模型交换中计算出的余弦相似度数据亲和性启发式规则,以此决定接收模型在融入本地模型时的影响力权重。通过这种方式,该算法能在学习进程更相似的客户端之间建立更强的模型融合影响力,使客户端能够策略性地平衡广泛协作以构建通用知识,与选择性协作以构建特定知识之间的关系。我们在图像识别和时间序列预测场景下,结合代表现实世界分布式智能系统的不同网络连接条件,将Chisme与现有方法进行比较评估。实验结果表明,Chisme在几乎所有案例中都优于最先进的边缘智能方法——使用Chisme的客户端展现出更快的训练收敛速度、更低的训练最终损失以及更小的客户端间性能差异。