As mobile devices and location-based services are increasingly developed in different smart city scenarios and applications, many unexpected privacy leakages have arisen due to geolocated data collection and sharing. User re-identification and other sensitive inferences are major privacy threats when geolocated data are shared with cloud-assisted applications. Significantly, four spatio-temporal points are enough to uniquely identify 95\% of the individuals, which exacerbates personal information leakages. To tackle malicious purposes such as user re-identification, we propose an LSTM-based adversarial mechanism with representation learning to attain a privacy-preserving feature representation of the original geolocated data (i.e., mobility data) for a sharing purpose. These representations aim to maximally reduce the chance of user re-identification and full data reconstruction with a minimal utility budget (i.e., loss). We train the mechanism by quantifying privacy-utility trade-off of mobility datasets in terms of trajectory reconstruction risk, user re-identification risk, and mobility predictability. We report an exploratory analysis that enables the user to assess this trade-off with a specific loss function and its weight parameters. The extensive comparison results on four representative mobility datasets demonstrate the superiority of our proposed architecture in mobility privacy protection and the efficiency of the proposed privacy-preserving features extractor. We show that the privacy of mobility traces attains decent protection at the cost of marginal mobility utility. Our results also show that by exploring the Pareto optimal setting, we can simultaneously increase both privacy (45%) and utility (32%).
翻译:由于移动装置和基于地点的服务在不同智能城市的情景和应用中日益发展,由于地理定位数据收集和共享,出现了许多意外的隐私渗漏。用户重新识别和其他敏感推断是在与云助应用程序共享地理定位数据时的重大隐私威胁。重要的是,四个时空点足以独特地识别个人95 ⁇,这加剧了个人信息渗漏。为了解决用户再识别等恶意目的,我们提议一个基于LSTM的对抗机制,以代表身份学习实现原始地理定位数据(即流动数据)的隐私保留特征,以共享目的。这些表述旨在最大限度地减少用户重新定位和全面数据重建的机会,同时提供最低限度的公用事业预算(即损失)。我们通过量化隐私和移动数据集的隐私交易,通过轨迹重建风险、用户再识别风险和流动性的可预测性,我们报告一项探索性分析,使用户能够以具体损失功能及其重量参数来评估这一交易的特征(即流动数据)。我们广泛进行比较的目的是最大限度地减少用户重新定位和完全重数据重组的机会,以最低用途预算(即损失)的流动性结构显示我们的拟议流动性保护效率,从而展示了我们的拟议流动性结构的流动性的稳定性,从而展示了我们拟议的流动性保护。