Deep learning has proven to be successful in various domains and for different tasks. However, when it comes to private data several restrictions are making it difficult to use deep learning approaches in these application fields. Recent approaches try to generate data privately instead of applying a privacy-preserving mechanism directly, on top of the classifier. The solution is to create public data from private data in a manner that preserves the privacy of the data. In this work, two very prominent GAN-based architectures were evaluated in the context of private time series classification. In contrast to previous work, mostly limited to the image domain, the scope of this benchmark was the time series domain. The experiments show that especially GSWGAN performs well across a variety of public datasets outperforming the competitor DPWGAN. An analysis of the generated datasets further validates the superiority of GSWGAN in the context of time series generation.
翻译:译文:从私有到公共:在私有时间序列分类的背景下对GAN进行基准测试
摘要:深度学习已经在不同领域和任务中取得了成功。然而,在涉及私有数据时,有几个限制使得在这些应用领域中使用深度学习方法变得困难。最近的方法尝试在分类器顶部生成数据,而不是直接应用隐私保护机制。解决方案是以保护数据的方式创建公共数据来自私有数据。在本研究中,评估了两种非常著名的GAN-based架构,在私有时间序列分类的背景下进行了评估。不同于以往主要局限于图像领域的研究,本基准测试的范围是时间序列领域。实验表明,尤其是GSWGAN在各种公共数据集上表现良好,优于竞争对手DPWGAN。生成的数据集的分析进一步验证了GSWGAN在时间序列生成领域的优越性。