不可逆表单式GANs:用单Stone杀死两只鸟,用于表单数据合成 (Invertible Tabular GANs: Killing Two Birds with OneStone for Tabular Data Synthesis) - 专知论文

会员服务 ·

0

求逆 · GANs · 可约的 · INFORMS · 正则化项 ·

2022 年 2 月 8 日

Invertible Tabular GANs: Killing Two Birds with OneStone for Tabular Data Synthesis

翻译：不可逆表单式GANs:用单Stone杀死两只鸟,用于表单数据合成

Jaehoon Lee,Jihyeon Hyeong,Jinsung Jeon,Noseong Park,Jihoon Cho

from arxiv, 19 pages

Tabular data synthesis has received wide attention in the literature. This is because available data is often limited, incomplete, or cannot be obtained easily, and data privacy is becoming increasingly important. In this work, we present a generalized GAN framework for tabular synthesis, which combines the adversarial training of GANs and the negative log-density regularization of invertible neural networks. The proposed framework can be used for two distinctive objectives. First, we can further improve the synthesis quality, by decreasing the negative log-density of real records in the process of adversarial training. On the other hand, by increasing the negative log-density of real records, realistic fake records can be synthesized in a way that they are not too much close to real records and reduce the chance of potential information leakage. We conduct experiments with real-world datasets for classification, regression, and privacy attacks. In general, the proposed method demonstrates the best synthesis quality (in terms of task-oriented evaluation metrics, e.g., F1) when decreasing the negative log-density during the adversarial training. If increasing the negative log-density, our experimental results show that the distance between real and fake records increases, enhancing robustness against privacy attacks.

翻译：文献中广泛关注了表层数据合成,因为现有数据往往有限、不完整或难以轻易获得,数据隐私越来越重要。在这项工作中,我们提出了一个通用的表格合成GAN框架,将全球网络的对抗性培训和不可视神经网络的负日密度正规化结合起来。拟议框架可用于两个不同的目标。首先,我们可以进一步改进综合质量,降低对抗性培训过程中真实记录的负日密度。另一方面,通过提高真实记录的负日密度,可以合成现实的假记录,使其不远接近真实记录,减少潜在信息泄漏的可能性。我们用真实世界数据集进行分类、回归和隐私攻击的实验。总体而言,拟议方法在减少对抗性培训中的负日密度时,可以进一步提高合成质量(任务导向评价指标,例如F1),从而降低负面日志密度。如果提高负面日志密度,则可以合成的假记录可以合成为真实的距离增加。

0

相关内容

【CVPR 2022】盲图像超分辨率退化分布的研究，Learning the Degradation Distribution for Blind Image Super-Resolution

【CVPR 2022】盲图像超分辨率退化分布的研究，Learning the Degradation Distribution for Blind Image Super-Resolution

专知会员服务

7+阅读 · 2022年3月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

面向遮挡条件下的人脸识别方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

自旋轨道耦合莫特绝缘体的量子磁性调控

国家自然科学基金

0+阅读 · 2014年12月31日

华支睾吸虫病致肝纤维化分泌排泄抗原新靶点果糖1,6二磷酸的CCK/Leptin途径的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

腺病毒介导的miRNA干扰策略抗MERS-CoV的研究

国家自然科学基金

0+阅读 · 2013年12月31日

腺病毒介导精氨酸脱亚氨基酶靶向性基因治疗肝癌的机制

国家自然科学基金

1+阅读 · 2012年12月31日

LIGHT增强IR加Veliparib诱导的衰老肿瘤细胞疫苗的抗肿瘤作用

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

布尔函数的密码性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

有限生成系的平坦覆盖与同调代数

国家自然科学基金

0+阅读 · 2009年12月31日

Learned Monocular Depth Priors in Visual-Inertial Initialization

Arxiv

0+阅读 · 2022年4月20日

Invertible Mask Network for Face Privacy-Preserving

Invertible Mask Network for Face Privacy-Preserving

Arxiv

0+阅读 · 2022年4月19日

BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis

Arxiv

0+阅读 · 2022年4月19日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Imbalanced Classification via a Tabular Translation GAN

Arxiv

0+阅读 · 2022年4月19日

UNBUS: Uncertainty-aware Deep Botnet Detection System in Presence of Perturbed Samples

Arxiv

1+阅读 · 2022年4月18日

Magnifying Networks for Images with Billions of Pixels

Arxiv

0+阅读 · 2022年4月18日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

DSGAN: Generative Adversarial Training for Distant Supervision Relation Extraction

Arxiv

15+阅读 · 2018年5月24日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】盲图像超分辨率退化分布的研究，Learning the Degradation Distribution for Blind Image Super-Resolution

【CVPR 2022】盲图像超分辨率退化分布的研究，Learning the Degradation Distribution for Blind Image Super-Resolution

专知会员服务

7+阅读 · 2022年3月12日

【Google】深度学习对抗鲁棒性，43页ppt

专知会员服务

45+阅读 · 2020年10月31日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

CVPR 2020 论文开源项目合集

专知会员服务

110+阅读 · 2020年3月12日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

数据智能体综述：新兴范式还是被高估的炒作？

海底战已至：美国构思海底安全战略 | 最新报告

【ICCV2025教程】视觉异常检测中的基础模型：进展、挑战与应用

美军将无人自主等新技术融入潜艇部队以更具杀伤力

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

相关论文

Learned Monocular Depth Priors in Visual-Inertial Initialization

Arxiv

0+阅读 · 2022年4月20日

Invertible Mask Network for Face Privacy-Preserving

Invertible Mask Network for Face Privacy-Preserving

Arxiv

0+阅读 · 2022年4月19日

BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis

Arxiv

0+阅读 · 2022年4月19日

Jacobian Ensembles Improve Robustness Trade-offs to Adversarial Attacks

Arxiv

0+阅读 · 2022年4月19日

Imbalanced Classification via a Tabular Translation GAN

Arxiv

0+阅读 · 2022年4月19日

UNBUS: Uncertainty-aware Deep Botnet Detection System in Presence of Perturbed Samples

Arxiv

1+阅读 · 2022年4月18日

Magnifying Networks for Images with Billions of Pixels

Arxiv

0+阅读 · 2022年4月18日

Adversarial and Contrastive Variational Autoencoder for Sequential Recommendation

Arxiv

17+阅读 · 2021年3月19日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

DSGAN: Generative Adversarial Training for Distant Supervision Relation Extraction

Arxiv

15+阅读 · 2018年5月24日

相关基金

面向遮挡条件下的人脸识别方法研究

国家自然科学基金

2+阅读 · 2015年12月31日

自旋轨道耦合莫特绝缘体的量子磁性调控

国家自然科学基金

0+阅读 · 2014年12月31日

华支睾吸虫病致肝纤维化分泌排泄抗原新靶点果糖1,6二磷酸的CCK/Leptin途径的研究

国家自然科学基金

0+阅读 · 2013年12月31日

基于Exemplar-Classifier思想的高分辨率光学遥感影像目标识别研究

国家自然科学基金

2+阅读 · 2013年12月31日

腺病毒介导的miRNA干扰策略抗MERS-CoV的研究

国家自然科学基金

0+阅读 · 2013年12月31日

腺病毒介导精氨酸脱亚氨基酶靶向性基因治疗肝癌的机制

国家自然科学基金

1+阅读 · 2012年12月31日

LIGHT增强IR加Veliparib诱导的衰老肿瘤细胞疫苗的抗肿瘤作用

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

布尔函数的密码性质研究

国家自然科学基金

0+阅读 · 2011年12月31日

有限生成系的平坦覆盖与同调代数

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员