Guaranteeing privacy in released data is an important goal for data-producing agencies. There has been extensive research on developing suitable privacy mechanisms in recent years. Particularly notable is the idea of noise addition with the guarantee of differential privacy. There are, however, concerns about compromising data utility when very stringent privacy mechanisms are applied. Such compromises can be quite stark in correlated data, such as time series data. Adding white noise to a stochastic process may significantly change the correlation structure, a facet of the process that is essential to optimal prediction. We propose the use of all-pass filtering as a privacy mechanism for regularly sampled time series data, showing that this procedure preserves utility while also providing sufficient privacy guarantees to entity-level time series.
翻译:保证发布数据的隐私是数据制作机构的一个重要目标。近年来,对开发适当的隐私机制进行了广泛的研究,特别值得注意的是,在保障有差别的隐私的情况下增加噪音的想法。然而,在适用非常严格的隐私机制时,人们担心会损害数据的实用性。这种妥协在相关数据(如时间序列数据)中可能非常明显。在随机进程中增加白色噪音可能会大大改变相关性结构,而这种结构是最佳预测所必不可少的过程的一个方面。我们提议使用“全面通行过滤”作为定期抽样的时间序列数据的隐私机制,表明这一程序在为实体一级的时间序列提供足够的隐私保障的同时,也能保持其实用性。