Machine learning (ML) is increasingly being adopted in a wide variety of application domains. Usually, a well-performing ML model, especially, emerging deep neural network model, relies on a large volume of training data and high-powered computational resources. The need for a vast volume of available data raises serious privacy concerns because of the risk of leakage of highly privacy-sensitive information and the evolving regulatory environments that increasingly restrict access to and use of privacy-sensitive data. Furthermore, a trained ML model may also be vulnerable to adversarial attacks such as membership/property inference attacks and model inversion attacks. Hence, well-designed privacy-preserving ML (PPML) solutions are crucial and have attracted increasing research interest from academia and industry. More and more efforts of PPML are proposed via integrating privacy-preserving techniques into ML algorithms, fusing privacy-preserving approaches into ML pipeline, or designing various privacy-preserving architectures for existing ML systems. In particular, existing PPML arts cross-cut ML, system, security, and privacy; hence, there is a critical need to understand state-of-art studies, related challenges, and a roadmap for future research. This paper systematically reviews and summarizes existing privacy-preserving approaches and proposes a PGU model to guide evaluation for various PPML solutions through elaborately decomposing their privacy-preserving functionalities. The PGU model is designed as the triad of Phase, Guarantee, and technical Utility. Furthermore, we also discuss the unique characteristics and challenges of PPML and outline possible directions of future work that benefit a wide range of research communities among ML, distributed systems, security, and privacy areas.
翻译:通常,良好的ML模式,特别是正在形成的深神经网络模式,依赖大量的培训数据和高功率计算资源;大量可用数据的需求引起严重的隐私关切,因为高度隐私敏感信息有可能泄漏,监管环境不断变化,越来越限制获取和使用隐私敏感数据;此外,经过培训的ML模式也可能易受敌对攻击,如会员/财产推断攻击和反向攻击模式。因此,设计良好的隐私保护ML(PPML)解决方案至关重要,并吸引学术界和工业界越来越多的研究兴趣。 提议将隐私保护技术纳入ML算法,将隐私保护方法应用于ML管道,或为现有ML系统设计各种隐私保护架构。 特别是,现有的PPML艺术交叉打击袭击和反向攻击模式、系统、安保和隐私模式。 因此,非常需要理解隐私保护ML(PML)解决方案的妥善设计,系统化的保密成本分析、相关流程流程流程流程,通过系统化的流程流程化的流程化流程化的流程化流程化流程化流程化的流程化流程化流程化、流程化的流程化流程化流程化流程化的流程化流程化流程化流程化流程、流程化流程化流程、流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程流程流程流程流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程流程流程化流程化流程流程流程流程流程、流程化流程化流程化流程、流程、流程化流程化流程化流程、流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程流程化流程化流程流程化流程化流程化流程化流程流程流程化流程化流程化流程化流程流程流程流程流程流程流程流程流程流程流程流程流程化流程流程流程流程流程流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程化流程流程流程流程流程流程化流程化流程化流程化流程化流程化流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程化流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程流程