Location-based services (LBS) are increasingly used in recent years, and consequently a large amount of location traces are accumulating in a data center. Although these traces can be provided to a data analyst for geo-data analysis, the disclosure of location traces raises serious privacy concerns. Finding an appropriate anonymization method for location traces is also extremely challenging, especially for long traces. To address this issue, we have designed and held a location trace anonymization contest that deals with a long trace (400 events per user) and fine-grained locations (1024 regions). In our contest, each team anonymizes her original traces, and then the other teams perform privacy attacks against the anonymized traces (i.e., both defense and attack compete together) in a partial-knowledge attacker model where the adversary does not know the original traces. To realize such a contest, we propose a novel location synthesizer that has diversity in that synthetic traces for each team are different from those for the other teams and utility in that synthetic traces preserve various statistical features of real traces. We also show that re-identification alone is insufficient as a privacy risk, and that trace inference should be added as an additional risk. Specifically, we show an example of anonymization that is perfectly secure against re-identification and is not secure against trace inference. Based on this, our contest evaluates both the re-identification risk and trace inference risk, and analyzes the relation between the two risks. In this paper, we present our location synthesizer and the design of our contest, and then report our contest results.
翻译:近些年来越来越多地使用基于位置的服务(LBS),因此,大量位置痕迹正在数据中心中积累。虽然这些痕迹可以提供给数据分析师进行地理数据分析,但公布位置痕迹会引起严重的隐私问题。为位置痕迹寻找适当的匿名方法也极具挑战性,特别是长期痕迹。为了解决这一问题,我们设计并举办了一个位置追踪匿名竞赛,涉及长痕(每个用户400个事件)和细斑地点(1024个区域),在我们的竞争中,每个团队将她的原始痕迹匿名化,然后其他团队在部分知识攻击者模型中对匿名痕迹(即国防和攻击共同竞争)进行隐私攻击,而对手并不了解原始痕迹。为了实现这种竞争,我们建议建立一个新的位置合成特征合成特征,每个团队的合成痕迹不同于其他团队,合成痕迹保存各种真实痕迹的统计特征(1024个区域)。在我们的竞赛中,我们还表明仅重新识别身份是不够的作为隐私风险的隐私风险,而其他团队的隐私风险(即国防和攻击同时进行竞争)在部分知识攻击者模型中进行隐隐隐隐性攻击(即进行竞争),而精确地显示我们的历史定位设计。