分散学习,没有客户-客户间同步:分析客户-用户间拆分网络部分与总体业绩的比例 (Splitfed learning without client-side synchronization: Analyzing client-side split network portion size to overall performance)

Federated Learning (FL), Split Learning (SL), and SplitFed Learning (SFL) are three recent developments in distributed machine learning that are gaining attention due to their ability to preserve the privacy of raw data. Thus, they are widely applicable in various domains where data is sensitive, such as large-scale medical image classification, internet-of-medical-things, and cross-organization phishing email detection. SFL is developed on the confluence point of FL and SL. It brings the best of FL and SL by providing parallel client-side machine learning model updates from the FL paradigm and a higher level of model privacy (while training) by splitting the model between the clients and server coming from SL. However, SFL has communication and computation overhead at the client-side due to the requirement of client-side model synchronization. For the resource-constrained client-side, removal of such requirements is required to gain efficiency in the learning. In this regard, this paper studies SFL without client-side model synchronization. The resulting architecture is known as Multi-head Split Learning. Our empirical studies considering the ResNet18 model on MNIST data under IID data distribution among distributed clients find that Multi-head Split Learning is feasible. Its performance is comparable to the SFL. Moreover, SFL provides only 1%-2% better accuracy than Multi-head Split Learning on the MNIST test set. To further strengthen our results, we study the Multi-head Split Learning with various client-side model portions and its impact on the overall performance. To this end, our results find a minimal impact on the overall performance of the model.

翻译：联邦学习(FL)、分解学习(SL)、分解学习(SL)是分布式机器学习(SFL)的三项最新发展,由于它们有能力保存原始数据的隐私而日益引起注意。因此,在分布式机器学习(SFL)方面,这些发展广泛适用于数据敏感的各个领域,例如大规模医疗图像分类、医疗量互联网和跨组织网络网络网络网络网络网网钓电子邮件检测。在资源紧缺的客户方面,需要取消这类要求才能提高学习效率。在这方面,本文模型研究FLL和SL的最佳方法是从FL模式提供平行客户-端学习模型更新更新,通过将模型在SLSL的客户和服务器之间分割模型隐私(同时进行培训),将模型分开。然而,SFLFL在客户和服务器之间进行通信和计算管理管理,因为客户端需要客户端模型同步。