与多级数据工具有区别的私人数据出版 (Differentially Private Data Publication with Multi-level Data Utility)

Conventional private data publication mechanisms aim to retain as much data utility as possible while ensuring sufficient privacy protection on sensitive data. Such data publication schemes implicitly assume that all data analysts and users have the same data access privilege levels. However, it is not applicable for the scenario that data users often have different levels of access to the same data, or different requirements of data utility. The multi-level privacy requirements for different authorization levels pose new challenges for private data publication. Traditional PPDP mechanisms only publish one perturbed and private data copy satisfying some privacy guarantee to provide relatively accurate analysis results. To find a good tradeoff between privacy preservation level and data utility itself is a hard problem, let alone achieving multi-level data utility on this basis. In this paper, we address this challenge in proposing a novel framework of data publication with compressive sensing supporting multi-level utility-privacy tradeoffs, which provides differential privacy. Specifically, we resort to compressive sensing (CS) method to project a $n$-dimensional vector representation of users' data to a lower $m$-dimensional space, and then add deliberately designed noise to satisfy differential privacy. Then, we selectively obfuscate the measurement vector under compressive sensing by adding linearly encoded noise, and provide different data reconstruction algorithms for users with different authorization levels. Extensive experimental results demonstrate that ML-DPCS yields multi-level of data utility for specific users at different authorization levels.

翻译：常规私营数据公布机制旨在尽可能保留数据效用,同时确保敏感数据的充分隐私保护;这类数据公布机制暗含地假定所有数据分析员和用户拥有相同的数据访问特权水平;然而,数据用户往往有不同程度的获取相同数据的机会,或对数据效用的不同要求,这种假设并不适用;不同授权水平的多层次隐私要求给私人数据出版带来了新的挑战;传统的PPDP机制只公布一个受干扰的和私人的数据副本,满足一些隐私保证,以提供相对准确的分析结果;要找到隐私保护水平与数据效用本身之间的良好平衡是一个棘手问题,更不用说在此基础上实现多层次的数据数据效用;在本文件中,我们应对这一挑战,提出一个新的数据出版框架,采用压缩的遥感,支持多层次的通用数据交换;具体地说,我们采用压缩的遥感(CS)方法,将用户的数据以一元为单位,向较低维度的空间提供比较准确的矢量代表,然后刻意地设计噪音以满足差异隐私权。

相关内容

压缩感知

关注 275

压缩感知是近年来极为热门的研究前沿，在若干应用领域中都引起瞩目。 compressive sensing（CS）又称 compressived sensing ，compressived sample，大意是在采集信号的时候（模拟到数字），同时完成对信号压缩之意。与稀疏表示不同，压缩感知关注的是如何利用信号本身所具有的稀疏性，从部分观测样本中恢复原信号。

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

CCF-A类顶会WWW2021论文结果出炉，357篇上榜！你的论文中了吗？

专知会员服务

47+阅读 · 2021年1月17日

IJCAI2020接受论文列表，592篇论文pdf都在这了！

专知会员服务

64+阅读 · 2020年7月16日

【北京大学】Locally Differentially Private (Contextual) Bandits Learning

专知会员服务

13+阅读 · 2020年6月8日