Vertical Federated Learning (VFL) is a federated learning setting where multiple parties with different features about the same set of users jointly train machine learning models without exposing their raw data or model parameters. Motivated by the rapid growth in VFL research and real-world applications, we provide a comprehensive review of the concept and algorithms of VFL, as well as current advances and challenges in various aspects, including effectiveness, efficiency, and privacy. We provide an exhaustive categorization for VFL settings and privacy-preserving protocols and comprehensively analyze the privacy attacks and defense strategies for each protocol. In the end, we propose a unified framework, termed VFLow, which considers the VFL problem under communication, computation, privacy, and effectiveness constraints. Finally, we review the most recent advances in industrial applications, highlighting open challenges and future directions for VFL.
翻译:纵向联邦学习(VFL)是一个联盟式学习环境,许多对同一组用户具有不同特点的多方在不披露原始数据或模型参数的情况下联合培训机器学习模型,受VFL研究和现实世界应用的迅速增长的推动,我们全面审查了VFL的概念和算法,以及当前在包括有效性、效率和隐私在内的各个方面的进展和挑战。我们详尽地分类了VFL的设置和隐私保护协议,并全面分析了每个协议的隐私攻击和防御战略。最后,我们提出了一个统一框架,称为VFLow,其中审议了通信、计算、隐私和有效性限制下的VFL问题。最后,我们审查了工业应用的最新进展,突出了VFL的公开挑战和未来方向。