Everyday, large amounts of sensitive data \sai{is} distributed across mobile phones, wearable devices, and other sensors. Traditionally, these enormous datasets have been processed on a single system, with complex models being trained to make valuable predictions. Distributed machine learning techniques such as Federated and Split Learning have recently been developed to protect user \sai{data and} privacy better while ensuring high performance. Both of these distributed learning architectures have advantages and disadvantages. In this paper, we examine these tradeoffs and suggest a new hybrid Federated Split Learning architecture that combines the efficiency and privacy benefits of both. Our evaluation demonstrates how our hybrid Federated Split Learning approach can lower the amount of processing power required by each client running a distributed learning system, reduce training and inference time while keeping a similar accuracy. We also discuss the resiliency of our approach to deep learning privacy inference attacks and compare our solution to other recently proposed benchmarks.
翻译:每天,在移动电话、可磨损设备和其他传感器之间分发大量敏感数据。传统上,这些庞大的数据集是在单一系统中处理的,其复杂的模型正在得到培训,以作出有价值的预测。最近开发了联邦和分解学习等分散的机器学习技术,以更好地保护用户的隐私,同时确保高性能。这两种分散的学习结构都有优点和缺点。在本文中,我们研究了这些权衡,并提出了一个新的混合的联邦分解学习结构,将两者的效率和隐私效益结合起来。我们的评估表明,我们混合的联邦分解学习方法可以降低每个客户使用分布式学习系统所需的处理能力,减少培训和推算时间,同时保持类似的准确性。我们还讨论了我们深层学习隐私攻击的方法的弹性,并将我们的解决办法与其他最近提出的基准进行比较。