PMP: 时间序列关于敏感模式推断的隐私-软件矩阵概况 (PMP: Privacy-Aware Matrix Profile against Sensitive Pattern Inference for Time Series)

Recent rapid development of sensor technology has allowed massive fine-grained time series (TS) data to be collected and set the foundation for the development of data-driven services and applications. During the process, data sharing is often involved to allow the third-party modelers to perform specific time series data mining (TSDM) tasks based on the need of data owner. The high resolution of TS brings new challenges in protecting privacy. While meaningful information in high-resolution TS shifts from concrete point values to local shape-based segments, numerous research have found that long shape-based patterns could contain more sensitive information and may potentially be extracted and misused by a malicious third party. However, the privacy issue for TS patterns is surprisingly seldom explored in privacy-preserving literature. In this work, we consider a new privacy-preserving problem: preventing malicious inference on long shape-based patterns while preserving short segment information for the utility task performance. To mitigate the challenge, we investigate an alternative approach by sharing Matrix Profile (MP), which is a non-linear transformation of original data and a versatile data structure that supports many data mining tasks. We found that while MP can prevent concrete shape leakage, the canonical correlation in MP index can still reveal the location of sensitive long pattern. Based on this observation, we design two attacks named Location Attack and Entropy Attack to extract the pattern location from MP. To further protect MP from these two attacks, we propose a Privacy-Aware Matrix Profile (PMP) via perturbing the local correlation and breaking the canonical correlation in MP index vector. We evaluate our proposed PMP against baseline noise-adding methods through quantitative analysis and real-world case studies to show the effectiveness of the proposed method.

翻译：传感器技术的近期快速发展使得大量精细时间序列数据得以收集,并为数据驱动服务和应用的发展奠定了基础。在这一过程中,数据共享经常涉及让第三方建模者根据数据拥有者的需求执行特定的时间序列数据挖掘任务。TS的高分辨率在保护隐私方面带来了新的挑战。虽然高分辨率TS中有意义的信息从具体点值转移到基于形状的局部部分,但许多研究发现,基于形状的长式模式可能包含更敏感的信息,并有可能被恶意第三方提取和滥用。然而,在保存隐私的文献中,令人惊讶地很少探讨TS模式的隐私相关性问题。在这项工作中,我们考虑一个新的隐私保护问题:防止对基于形状的长式模式的恶意推断,同时保留用于实用性任务业绩的短段信息。为了减轻这一挑战,我们通过共享矩阵的矢量剖图(MP)来研究另一种替代方法,这是对原始数据的非线性转换,以及一个支持许多数据挖掘任务的多功能结构。我们发现,虽然MPA的隐私关联性关系在保存隐私文献的文献中很少探讨。我们可以通过移动的定位模型来分析,但是,我们也可以在移动的精确定位上显示我们的目标定位定位的路径上,我们也可以在移动的路径上显示,我们通过移动攻击的精确定位的路径上进行。