以基于不确定性的自动自动编码器为基础的隐私和实用保存数据类型自觉转换 (Uncertainty-Autoencoder-Based Privacy and Utility Preserving Data Type Conscious Transformation)

We propose an adversarial learning framework that deals with the privacy-utility tradeoff problem under two types of conditions: data-type ignorant, and data-type aware. Under data-type aware conditions, the privacy mechanism provides a one-hot encoding of categorical features, representing exactly one class, while under data-type ignorant conditions the categorical variables are represented by a collection of scores, one for each class. We use a neural network architecture consisting of a generator and a discriminator, where the generator consists of an encoder-decoder pair, and the discriminator consists of an adversary and a utility provider. Unlike previous research considering this kind of architecture, which leverages autoencoders (AEs) without introducing any randomness, or variational autoencoders (VAEs) based on learning latent representations which are then forced into a Gaussian assumption, our proposed technique introduces randomness and removes the Gaussian assumption restriction on the latent variables, only focusing on the end-to-end stochastic mapping of the input to privatized data. We test our framework on different datasets: MNIST, FashionMNIST, UCI Adult, and US Census Demographic Data, providing a wide range of possible private and utility attributes. We use multiple adversaries simultaneously to test our privacy mechanism -- some trained from the ground truth data and some trained from the perturbed data generated by our privacy mechanism. Through comparative analysis, our results demonstrate better privacy and utility guarantees than the existing works under similar, data-type ignorant conditions, even when the latter are considered under their original restrictive single-adversary model.

翻译：我们提出一个对抗性学习框架,在两类条件下处理隐私权-效用权衡问题:数据类型无知和数据类型认知。在数据类型认知条件下,隐私机制提供绝对特征的一热编码,完全代表一个类别,而在数据类型无知条件下,绝对变量代表一个分数的集合,每个类别一个。我们使用由生成者和歧视者组成的神经网络架构,其中生成者包括一个编码交换器配对,歧视者包括一个对手和一个公用事业提供者。与以往研究这种结构不同,这种结构利用了自译自算器(AEs),而没有引入任何随机性或变式自动编码器(VAEs),而在数据类型不明的条件下学习了潜在表达,然后被强制纳入高尔斯假设。我们提出的真理技术引入了随机性,并消除了高斯对潜在变量的假设限制,其中发电机由一个编码交换器组成,而制成的导师则由对手和一家公用事业供应商组成。我们测试了不同数据设置的框架:MNIST、FASIM-Q-Q-AL 数据在经过培训的私人隐私分析机制下提供更好的数据,我们经过培训的、经过内部测试的、经过系统测试的、经过系统测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过我们国空基数级数据测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过测试的、经过我们的数据的、经过测试的、经过我们的一些数据机制的、经过测试的、经过测试的、经过我们的数据的、经过我们的数据的、经过我们的数据的、经过我们的数据的、经过测试的、经过的、经过测试的、经过测试的数据范围的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的、经过的

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

【开放书】卡耐基梅隆大学Elaine Shi 教授《Foundations of Distributed Consensus and Blockchains（分布式共识和区块链的基础）》150页pdf

专知会员服务

30+阅读 · 2022年2月22日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日