With mobile phone penetration rates reaching 90%, Consumer Proprietary Network Information (CPNI) can offer extremely valuable information to different sectors, including policymakers. Indeed, as part of CPNI, Call Detail Records have been successfully used to provide real-time traffic information, to improve our understanding of the dynamics of people's mobility and so to allow prevention and measures in fighting infectious diseases, and to offer population statistics. While there is no doubt of the usefulness of CPNI data, privacy concerns regarding sharing individuals' data have prevented it from being used to its full potential. Traditional de-anonymization measures, such as pseudonymization and standard de-identification, have been shown to be insufficient to protect privacy. This has been specifically shown on mobile phone datasets. As an example, researchers have shown that with only four data points of approximate place and time information of a user, 95% of users could be re-identified in a dataset of 1.5 million mobile phone users. In this landscape paper, we will discuss the state-of-the-art anonymization techniques and their shortcomings.
翻译:随着移动电话普及率达到90%,消费者产权网络信息(CPNI)可以向不同部门,包括决策者提供极有价值的信息。事实上,作为CPNI的一部分,Call Retal Record已被成功地用于提供实时交通信息,以增进我们对人口流动性动态的了解,从而在防治传染病方面采取预防和措施,并提供人口统计数据。毫无疑问,NNI数据的有用性在分享个人数据方面的隐私关切使个人数据无法充分利用其潜力。传统的匿名化措施,如假名化和标准身份识别,已经证明不足以保护隐私。这在移动电话数据集中得到了具体显示。例如,研究人员已经表明,只有四个拥有用户近似地点和时间信息的数据点,95%的用户可以在150万移动电话用户的数据集中重新确认。在这个景观文件中,我们将讨论最先进的匿名化技术及其缺点。