With the rapid demand of data and computational resources in deep learning systems, a growing number of algorithms to utilize collaborative machine learning techniques, for example, federated learning, to train a shared deep model across multiple participants. It could effectively take advantage of the resources of each participant and obtain a more powerful learning system. However, integrity and privacy threats in such systems have greatly obstructed the applications of collaborative learning. And a large amount of works have been proposed to maintain the model integrity and mitigate the privacy leakage of training data during the training phase for different collaborative learning systems. Compared with existing surveys that mainly focus on one specific collaborative learning system, this survey aims to provide a systematic and comprehensive review of security and privacy researches in collaborative learning. Our survey first provides the system overview of collaborative learning, followed by a brief introduction of integrity and privacy threats. In an organized way, we then detail the existing integrity and privacy attacks as well as their defenses. We also list some open problems in this area and opensource the related papers on GitHub: https://github.com/csl-cqu/awesome-secure-collebrative-learning-papers.
翻译:由于深层学习系统对数据和计算资源的需求迅速,越来越多的算法利用合作机器学习技术,例如联合学习,对多个参与者进行共同的深层次模型培训,可以有效地利用每个参与者的资源,并获得更强大的学习系统;然而,由于这些系统中的完整和隐私威胁极大地阻碍了合作学习的应用;还提议进行大量工作,以保持模型的完整性,减少不同合作学习系统培训阶段培训数据隐私的泄漏;与主要侧重于一个具体合作学习系统的现有调查相比,这项调查旨在对合作学习中的安全和隐私研究进行系统、全面的审查;我们的调查首先提供了合作学习系统概览,随后简要介绍了完整性和隐私威胁;然后,我们有组织地详细介绍了现有的完整和隐私攻击及其防御系统;我们还列举了这方面的一些公开问题,并公开来源了有关GitHub的文件:https://github.com/csl-cqu/awecome-secure-coebristrate-leclear-papers。