We propose an adversarial learning framework that deals with the privacy-utility tradeoff problem under two types of conditions: data-type ignorant, and data-type aware. Under data-type aware conditions, the privacy mechanism provides a one-hot encoding of categorical features, representing exactly one class, while under data-type ignorant conditions the categorical variables are represented by a collection of scores, one for each class. We use a neural network architecture consisting of a generator and a discriminator, where the generator consists of an encoder-decoder pair, and the discriminator consists of an adversary and a utility provider. Unlike previous research considering this kind of architecture, which leverages autoencoders (AEs) without introducing any randomness, or variational autoencoders (VAEs) based on learning latent representations which are then forced into a Gaussian assumption, our proposed technique introduces randomness and removes the Gaussian assumption restriction on the latent variables, only focusing on the end-to-end stochastic mapping of the input to privatized data. We test our framework on different datasets: MNIST, FashionMNIST, UCI Adult, and US Census Demographic Data, providing a wide range of possible private and utility attributes. We use multiple adversaries simultaneously to test our privacy mechanism -- some trained from the ground truth data and some trained from the perturbed data generated by our privacy mechanism. Through comparative analysis, our results demonstrate better privacy and utility guarantees than the existing works under similar, data-type ignorant conditions, even when the latter are considered under their original restrictive single-adversary model.
翻译:我们提出一个对抗性学习框架,在两类条件下处理隐私权-效用权衡问题:数据类型无知和数据类型认知。在数据类型认知条件下,隐私机制提供绝对特征的一热编码,完全代表一个类别,而在数据类型无知条件下,绝对变量代表一个分数的集合,每个类别一个。我们使用由生成者和歧视者组成的神经网络架构,其中生成者包括一个编码交换器配对,歧视者包括一个对手和一个公用事业提供者。与以往研究这种结构不同,这种结构利用了自译自算器(AEs),而没有引入任何随机性或变式自动编码器(VAEs),而在数据类型不明的条件下学习了潜在表达,然后被强制纳入高尔斯假设。我们提出的真理技术引入了随机性,并消除了高斯对潜在变量的假设限制,其中发电机由一个编码交换器组成,而制成的导师则由对手和一家公用事业供应商组成。我们测试了不同数据设置的框架:MNIST、FASIM-Q-Q-AL 数据在经过培训的私人隐私分析机制下提供更好的数据,我们经过培训的、经过内部测试的、经过系统测试的、经过系统测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过我们国空基数级数据测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过我们的数据的、经过测试的、经过我们的一些数据机制的、经过测试的、经过测试的、经过我们的数据的、经过我们的数据的、经过我们的数据的、经过我们的数据的、经过测试的、经过的、经过测试的、经过测试的数据范围的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的