While machine learning (ML) has made tremendous progress during the past decade, recent research has shown that ML models are vulnerable to various security and privacy attacks. So far, most of the attacks in this field focus on discriminative models, represented by classifiers. Meanwhile, little attention has been paid to the security and privacy risks of generative models, such as generative adversarial networks (GANs). In this paper, we propose the first set of training dataset property inference attacks against GANs. Concretely, the adversary aims to infer the macro-level training dataset property, i.e., the proportion of samples used to train a target GAN with respect to a certain attribute. A successful property inference attack can allow the adversary to gain extra knowledge of the target GAN's training dataset, thereby directly violating the intellectual property of the target model owner. Also, it can be used as a fairness auditor to check whether the target GAN is trained with a biased dataset. Besides, property inference can serve as a building block for other advanced attacks, such as membership inference. We propose a general attack pipeline that can be tailored to two attack scenarios, including the full black-box setting and partial black-box setting. For the latter, we introduce a novel optimization framework to increase the attack efficacy. Extensive experiments over four representative GAN models on five property inference tasks show that our attacks achieve strong performance. In addition, we show that our attacks can be used to enhance the performance of membership inference against GANs.
翻译:虽然机器学习(ML)在过去十年中取得了巨大进展,但最近的研究表明,ML模型很容易受到各种安全和隐私攻击的伤害。到目前为止,该领域的大多数攻击都集中在以分类者为代表的歧视性模型上。与此同时,很少注意基因变异模型的安全和隐私风险,例如基因对抗网络(GANs) 。在本文中,我们提议第一套培训数据集属性对GANs的攻击。具体地说,对手的目的是推断宏观培训数据集属性,即用于培训目标GAN的样本比例与某个属性有关。成功的财产推断攻击能够让对手更多地了解GAN的培训数据集,从而直接侵犯目标模型所有者的知识产权。此外,我们还可以用它作为公平审计员来检查目标GAN是否受到有偏差的数据集的培训。此外,财产推断可以用来作为其他高级攻击的建筑块,例如用来对某个属性进行训练的样品比例。我们建议对GAN公司的攻击目标进行大规模攻击的实验,可以提高GAN公司对目标的性能框架的准确性能,从而显示我们用来对攻击的四种攻击的精确性攻击情景的精确性。我们用在黑推断中,在黑推断中提出一个普通攻击性攻击试验中可以提高整个攻击试验。我们用来显示攻击性攻击的性攻击的性能框架,在后试验中可以显示整个攻击的性能框架。我们用来显示的四种攻击的性能攻击的性能。