DPD-fVAE: 使用不同私营解码器的联邦变式自动自动编码器制作合成数据 (DPD-fVAE: Synthetic Data Generation Using Federated Variational Autoencoders With Differentially-Private Decoder)

Federated learning (FL) is getting increased attention for processing sensitive, distributed datasets common to domains such as healthcare. Instead of directly training classification models on these datasets, recent works have considered training data generators capable of synthesising a new dataset which is not protected by any privacy restrictions. Thus, the synthetic data can be made available to anyone, which enables further evaluation of machine learning architectures and research questions off-site. As an additional layer of privacy-preservation, differential privacy can be introduced into the training process. We propose DPD-fVAE, a federated Variational Autoencoder with Differentially-Private Decoder, to synthesise a new, labelled dataset for subsequent machine learning tasks. By synchronising only the decoder component with FL, we can reduce the privacy cost per epoch and thus enable better data generators. In our evaluation on MNIST, Fashion-MNIST and CelebA, we show the benefits of DPD-fVAE and report competitive performance to related work in terms of Fr\'echet Inception Distance and accuracy of classifiers trained on the synthesised dataset.

翻译：联邦学习组织(FL)日益重视处理保健等领域常见的敏感、分布式数据集。最近的工作不是直接培训关于这些数据集的分类模型,而是考虑培训能够合成没有隐私限制保护的新数据集的数据生成器。因此,可以向任何人提供合成数据,从而能够进一步评估机器学习架构和场外研究问题。作为保护隐私的另外一层,可以在培训过程中引入不同的隐私。我们提议DPD-fVAE,即一个与差异-私人解密公司联合的动态自动计算机,为随后的机器学习任务合成一个新的、贴标签的数据集。通过只与FL同步解密组件,我们可以降低每粒子的隐私成本,从而使得更好的数据生成器得以使用。在对MNIST、Fashaon-MNIST和CelebA的评估中,我们展示DD-fVAE的好处,并报告在Fr\hechet Incepion远程和在合成数据集培训的分类员的准确性工作方面的竞争性表现。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

专知会员服务

39+阅读 · 2020年11月3日

因果图，Causal Graphs，52页ppt