Spontaneous reporting systems (SRS) have been developed to collect adverse event records that contain personal demographics and sensitive information like drug indications and adverse reactions. The release of SRS data may disclose the privacy of the data provider. Unlike other microdata, very few anonymyization methods have been proposed to protect individual privacy while publishing SRS data. MS(k, {\theta}*)-bounding is the first privacy model for SRS data that considers multiple individual records, mutli-valued sensitive attributes, and rare events. PPMS(k, {\theta}*)-bounding then is proposed for solving cross-release attacks caused by the follow-up cases in the periodical SRS releasing scenario. A recent trend of microdata anonymization combines the traditional syntactic model and differential privacy, fusing the advantages of both models to yield a better privacy protection method. This paper proposes the PPMS-DP(k, {\theta}*, {\epsilon}) framework, an enhancement of PPMS(k, {\theta}*)-bounding that embraces differential privacy to improve privacy protection of periodically released SRS data. We propose two anonymization algorithms conforming to the PPMS-DP(k, {\theta}*, {\epsilon}) framework, PPMS-DPnum and PPMS-DPall. Experimental results on the FAERS datasets show that both PPMS-DPnum and PPMS-DPall provide significantly better privacy protection than PPMS-(k, {\theta}*)-bounding without sacrificing data distortion and data utility.
翻译:已经开发了Spontane报告系统(SRS),以收集包含个人人口和敏感信息(如药物迹象和不良反应)的不良事件记录。发布SRS数据可能披露数据提供者的隐私。与其他微观数据不同,很少有匿名方法被提出来保护个人隐私,同时发布SRS数据。MS(k, ~theta ⁇ )是SRS数据的第一个隐私模式,其中考虑到多个个人记录、粘土价值敏感属性和稀有事件。PPMS(k, ~theta ⁇ )访问(然后建议解决SRS数据发布情景中的后续案例造成的交叉释放攻击。最近微数据匿名化的趋势将传统的合成模型和差异隐私结合起来,同时利用两种模型的优势来产生更好的隐私保护方法。本文提议采用PPMS-DP(k, ~thethetli值敏感特性)框架,加强PPMS(k, ~thethe the us-treal-DP)框架,WMS-wear-tal-DP)在不包含不同隐私的SLisal MS数据保护, IMS-salalalalal IMS