成员对机器学习的推断攻击:一项调查 (Membership Inference Attacks on Machine Learning: A Survey)

Membership inference attack aims to identify whether a data sample was used to train a machine learning model or not. It can raise severe privacy risks as the membership can reveal an individual's sensitive information. For example, identifying an individual's participation in a hospital's health analytics training set reveals that this individual was once a patient in that hospital. Membership inference attacks have been shown to be effective on various machine learning models, such as classification models, generative models, and sequence-to-sequence models. Meanwhile, many methods are proposed to defend such a privacy attack. Although membership inference attack is an emerging and rapidly growing research area, there is no comprehensive survey on this topic yet. In this paper, we bridge this important gap in membership inference attack literature. We present the first comprehensive survey of membership inference attacks. We summarize and categorize existing membership inference attacks and defenses and explicitly present how to implement attacks in various settings. Besides, we discuss why membership inference attacks work and summarize the benchmark datasets to facilitate comparison and ensure fairness of future work. Finally, we propose several possible directions for future research and possible applications relying on reviewed works.

翻译：身份推断攻击旨在确定数据样本是否被用于培训机器学习模型; 它可以带来严重的隐私风险,因为成员可以披露个人敏感信息; 例如,确定个人参与医院健康分析培训的一组情况表明,此人曾经是该医院的病人; 成员推断攻击已证明对各种机器学习模式,例如分类模型、基因模型和顺序顺序模型有效; 同时,提出了许多方法来保护这种隐私攻击。虽然成员推断攻击是一个新兴和迅速增长的研究领域,但尚未对这一主题进行全面调查。在本文中,我们弥合了在成员推断攻击文献方面的这一重要差距。我们首次对成员推断攻击进行了全面调查。我们总结和分类了现有的成员推断攻击和防御,并明确介绍了在不同环境中实施攻击的方法。此外,我们讨论了成员资格推断攻击工作的原因,并总结了基准数据集,以便利比较并确保未来工作的公正性。最后,我们提出了未来研究的若干可能的方向,并可能依靠已审查的工程的应用。