区别式私人合成数据 (Algorithmically Effective Differentially Private Synthetic Data) - 专知论文

会员服务 ·

0

分解的 · 模型评估 · 数据集 · FAST · 优化器 ·

2023 年 2 月 11 日

Algorithmically Effective Differentially Private Synthetic Data

翻译：区别式私人合成数据

Yiyun He,Roman Vershynin,Yizhe Zhu

from arxiv, 23 pages

We present a highly effective algorithmic approach for generating $\varepsilon$-differentially private synthetic data in a bounded metric space with near-optimal utility guarantees under the 1-Wasserstein distance. In particular, for a dataset $\mathcal X$ in the hypercube $[0,1]^d$, our algorithm generates synthetic dataset $\mathcal Y$ such that the expected 1-Wasserstein distance between the empirical measure of $\mathcal X$ and $\mathcal Y$ is $O((\varepsilon n)^{-1/d})$ for $d\geq 2$, and is $O(\log^2(\varepsilon n)(\varepsilon n)^{-1})$ for $d=1$. The accuracy guarantee is optimal up to a constant factor for $d\geq 2$, and up to a logarithmic factor for $d=1$. Our algorithm has a fast running time of $O(\varepsilon n)$ for all $d\geq 1$ and demonstrates improved accuracy compared to the method in (Boedihardjo et al., 2022) for $d\geq 2$.

翻译：我们提出了一种非常有效的算法方法,用于在1瓦瑟斯坦距离下,以1瓦瑟斯坦距离以近最佳的效用保证在封闭的公制空间中生成美元和瓦瑟斯隆的私人合成数据。特别是,对于超立方[$0,1美元]的数据集,我们的算法产生合成数据集$gmathcal Y$,因此,预期Wasserstein在1瓦瑟斯坦以1美元和1美元的经验计量标准之间的距离为$gmassal X$和$gmathcal Y$。我们的算法以2美元为单位,以2美元为单位,以美元为单位,以2美元为单位,以1美元为单位,以美元为单位,以1美元为单位,以美元为单位,以1瓦瑟斯斯坦为单位,以1美元为单位,以1美元为单位,以1美元为单位,以1美元为单位,以美元为单位计算,以美元为美元计算,以美元为美元/美元为美元计算,以美元为美元计算速度运行时间为1美元,以美元为美元,以美元为美元为美元,以美元为美元为美元为美元,以美元为美元为美元,以美元为美元为美元,以美元为美元为美元为美元,以美元为美元为美元,以美元为美元为美元,以美元为美元为美元,以美元为美元,以美元为美元为美元,以美元为美元,以美元为美元为美元为美元,以美元为美元为美元,以美元为美元为美元为美元为美元为美元为美元为美元,以美元,以美元,以美元为美元为美元为美元,以美元为美元为美元为美元为美元为美元计算的精确比为美元,以美元,以美元,以美元计算的精确度为美元,以美元为美元,以美元为美元为美元为美元,以美元,以美元为美元为美元,以美元为美元,以美元为美元为美元为美元,以美元为美元为美元为美元,以美元为美元为美元为美元为美元为美元,以美元为美元为美元为美元,以美元为美元为美元为美元为美元为美元为美元为美元,以美元计算的计算的计算的精确比的精确比的精确比为美元,以美元,比

0

相关内容

分解的

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

专知会员服务

17+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

GJ的Ca2+传递引起钙稳态失衡诱导内质网应激在肝移植术后急性肾损伤中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

蛋白磷酸酶2A在NO供体诱导肝癌细胞凋亡中的调节作用

国家自然科学基金

0+阅读 · 2015年12月31日

神经酰胺调控Ca2+-ERS通路诱导涎腺腺样囊性癌细胞凋亡及其分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

粗糙核奇异积分算子的若干问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

OPG诱导破骨细胞凋亡的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

HIPK2在高糖介导足细胞损伤中调节机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

GaN异质结中快重离子引起电离损伤的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Learning Treatment Effects in Panels with General Intervention Patterns

Learning Treatment Effects in Panels with General Intervention Patterns

Arxiv

0+阅读 · 2023年3月31日

Large Dimensional Independent Component Analysis: Statistical Optimality and Computational Tractability

Large Dimensional Independent Component Analysis: Statistical Optimality and Computational Tractability

Arxiv

0+阅读 · 2023年3月31日

A data-driven method for parametric PDE Eigenvalue Problems using Gaussian Process with different covariance functions

Arxiv

0+阅读 · 2023年3月31日

Differentially Private Stochastic Convex Optimization in (Non)-Euclidean Space Revisited

Arxiv

0+阅读 · 2023年3月31日

Conflict-Averse Gradient Optimization of Ensembles for Effective Offline Model-Based Optimization

Arxiv

0+阅读 · 2023年3月31日

Bootstrapping multiple systems estimates to account for model selection

Arxiv

0+阅读 · 2023年3月31日

On Rényi Differential Privacy in Statistics-Based Synthetic Data Generation

Arxiv

0+阅读 · 2023年3月31日

Differentially Private Vertical Federated Clustering

Arxiv

0+阅读 · 2023年3月31日

Active Bayesian Causal Inference

Arxiv

14+阅读 · 2022年10月15日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

VIP会员

文章信息

相关主题

相关VIP内容

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

【CVPR 2022】基于本地正则化和稀疏化差分隐私的联邦学习，Differentially Private Federated Learning with Local Regularization and Sparsification

专知会员服务

17+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《城市滨海地区：理解复杂多变环境下的指挥控制框架》50页报告

《理解城市战及其在俄乌战争中的表现》报告

美空军“顶点2025”实验：推进AI在C2、动态目标锁定与联盟集成中的应用

《建设式兵棋模拟作为战术集群配置优化的关键组成部分》

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Learning Treatment Effects in Panels with General Intervention Patterns

Learning Treatment Effects in Panels with General Intervention Patterns

Arxiv

0+阅读 · 2023年3月31日

Large Dimensional Independent Component Analysis: Statistical Optimality and Computational Tractability

Large Dimensional Independent Component Analysis: Statistical Optimality and Computational Tractability

Arxiv

0+阅读 · 2023年3月31日

A data-driven method for parametric PDE Eigenvalue Problems using Gaussian Process with different covariance functions

Arxiv

0+阅读 · 2023年3月31日

Differentially Private Stochastic Convex Optimization in (Non)-Euclidean Space Revisited

Arxiv

0+阅读 · 2023年3月31日

Conflict-Averse Gradient Optimization of Ensembles for Effective Offline Model-Based Optimization

Arxiv

0+阅读 · 2023年3月31日

Bootstrapping multiple systems estimates to account for model selection

Arxiv

0+阅读 · 2023年3月31日

On Rényi Differential Privacy in Statistics-Based Synthetic Data Generation

Arxiv

0+阅读 · 2023年3月31日

Differentially Private Vertical Federated Clustering

Arxiv

0+阅读 · 2023年3月31日

Active Bayesian Causal Inference

Arxiv

14+阅读 · 2022年10月15日

Federated Causal Inference in Heterogeneous Observational Data

Arxiv

24+阅读 · 2021年8月10日

相关基金

GJ的Ca2+传递引起钙稳态失衡诱导内质网应激在肝移植术后急性肾损伤中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

蛋白磷酸酶2A在NO供体诱导肝癌细胞凋亡中的调节作用

国家自然科学基金

0+阅读 · 2015年12月31日

神经酰胺调控Ca2+-ERS通路诱导涎腺腺样囊性癌细胞凋亡及其分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

粗糙核奇异积分算子的若干问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

可压缩Navier-Stokes方程和Boltzmann方程解的渐近行为

国家自然科学基金

0+阅读 · 2013年12月31日

OPG诱导破骨细胞凋亡的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

HIPK2在高糖介导足细胞损伤中调节机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

组蛋白去乙酰化酶抑制剂对骨关节炎中Notch-NFAT信号通路调控的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

GaN异质结中快重离子引起电离损伤的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员