Federated learning (FL) and split learning (SL) are the two popular distributed machine learning (ML) approaches that provide some data privacy protection mechanisms. In the time-series classification problem, many researchers typically use 1D convolutional neural networks (1DCNNs) based on the SL approach with a single client to reduce the computational overhead at the client-side while still preserving data privacy. Another method, recurrent neural network (RNN), is utilized on sequentially partitioned data where segments of multiple-segment sequential data are distributed across various clients. However, to the best of our knowledge, it is still not much work done in SL with long short-term memory (LSTM) network, even the LSTM network is practically effective in processing time-series data. In this work, we propose a new approach, LSTMSPLIT, that uses SL architecture with an LSTM network to classify time-series data with multiple clients. The differential privacy (DP) is applied to solve the data privacy leakage. The proposed method, LSTMSPLIT, has achieved better or reasonable accuracy compared to the Split-1DCNN method using the electrocardiogram dataset and the human activity recognition dataset. Furthermore, the proposed method, LSTMSPLIT, can also achieve good accuracy after applying differential privacy to preserve the user privacy of the cut layer of the LSTMSPLIT.
翻译:联邦学习(FL)和分解学习(SL)是两种流行的分布式机器学习(ML)方法,提供一些数据隐私保护机制。在时间序列分类问题中,许多研究人员通常使用基于单一客户的SL方法的1D进化神经网络(1DCNNS),以减少客户端的计算间接费用,同时仍然保护数据隐私。另一种方法,即经常性神经网络(RNNN),用于按顺序分割的数据,即多层相系相继数据部分在各客户之间分布。然而,据我们所知,在长期短期内存储(LSTM)网络的SL工作仍然不多,甚至LSTM网络在处理时间序列数据方面实际上有效。在这项工作中,我们提议采用一种新的方法,即LSTMSPLIT结构,利用LSTM网络对多个客户的时间序列数据进行分类。使用差异隐私(DP)解决数据隐私渗漏问题。拟议的方法,即LSTMSPLIT,与Sl-1-DCNNNS方法相比,与Splet-S-ST-SL的准确性数据层比较,在使用拟议的L-SRIT数据系统后,还可以识别数据识别数据系统,还可以识别数据识别数据识别识别。