Over the recent years, with the increasing adoption of Federated Learning (FL) algorithms and growing concerns over personal data privacy, Privacy-Preserving Federated Learning (PPFL) has attracted tremendous attention from both academia and industry. Practical PPFL typically allows multiple participants to individually train their machine learning models, which are then aggregated to construct a global model in a privacy-preserving manner. As such, Privacy-Preserving Aggregation (PPAgg) as the key protocol in PPFL has received substantial research interest. This survey aims to fill the gap between a large number of studies on PPFL, where PPAgg is adopted to provide a privacy guarantee, and the lack of a comprehensive survey on the PPAgg protocols applied in FL systems. In this survey, we review the PPAgg protocols proposed to address privacy and security issues in FL systems. The focus is placed on the construction of PPAgg protocols with an extensive analysis of the advantages and disadvantages of these selected PPAgg protocols and solutions. Additionally, we discuss the open-source FL frameworks that support PPAgg. Finally, we highlight important challenges and future research directions for applying PPAgg to FL systems and the combination of PPAgg with other technologies for further security improvement.
翻译:近年来,随着联邦学习联合会(FL)算法的采用以及对个人数据隐私的日益关切,隐私保护联邦学习联合会(PPFL)吸引了学术界和工业界的极大关注,实用PPFL通常允许多个参与者单独培训机器学习模式,然后将这些模式汇总,以便以保护隐私的方式构建一个全球模式,因此,作为PPPL关键议定书的隐私保护聚合(PPAgg)得到了大量的研究兴趣,这项调查旨在填补关于PPPFL的大量研究之间的差距,即采用PPPPAgg来提供隐私保障,而缺乏对FL系统应用的PPPAgg协议的全面调查。在这次调查中,我们审查了PPPAgg协议, 目的是为了解决FL系统中的隐私和安全问题,重点是建立PPAgg协议,广泛分析这些选定的PPPAgg协议和解决方案的利弊。此外,我们讨论了支持PPPPAgg的开放源框架。最后,我们强调了将PPPA技术与FL系统的其他安全改进组合的重大挑战和未来研究方向。