The data management of large companies often prioritize more recent data, as a source of higher accuracy prediction than outdated data. For example, the Facebook data policy retains user search histories for $6$ months while the Google data retention policy states that browser information may be stored for up to $9$ months. These policies are captured by the sliding window model, in which only the most recent $W$ statistics form the underlying dataset. In this paper, we consider the problem of privately releasing the $L_2$-heavy hitters in the sliding window model, which include $L_p$-heavy hitters for $p\le 2$ and in some sense are the strongest possible guarantees that can be achieved using polylogarithmic space, but cannot be handled by existing techniques due to the sub-additivity of the $L_2$ norm. Moreover, existing non-private sliding window algorithms use the smooth histogram framework, which has high sensitivity. To overcome these barriers, we introduce the first differentially private algorithm for $L_2$-heavy hitters in the sliding window model by initiating a number of $L_2$-heavy hitter algorithms across the stream with significantly lower threshold. Similarly, we augment the algorithms with an approximate frequency tracking algorithm with significantly higher accuracy. We then use smooth sensitivity and statistical distance arguments to show that we can add noise proportional to an estimation of the $L_2$ norm. To the best of our knowledge, our techniques are the first to privately release statistics that are related to a sub-additive function in the sliding window model, and may be of independent interest to future differentially private algorithmic design in the sliding window model.
翻译:大公司的数据管理往往优先考虑较近期的数据,因为其准确性预测高于过时的数据。例如,Facebook数据政策保留用户搜索历史6个月,而Google数据保留政策则指出浏览器信息可能存储最多9个月。这些政策被滑动窗口模型所捕捉,其中只有最新的美元数据构成基本数据集。在本文中,我们考虑在滑动窗口模型中私下释放$L_2美元重击手的问题,其中包括$p\le 2$的美元-重击手,从某种意义上说,这是使用多logariphy空间可以实现的最有力的保证,但由于$L_2美元的规范的次增加性,无法用现有技术来处理。此外,现有的非私人滑动窗口算法使用平滑直方图框架,这种框架具有高度敏感性。要克服这些障碍,我们引入了在滑动窗口模型中首次使用美元-p$p$2美元-重重击击手法的私人算手法,通过启动美元-美元-L_2美元的快速度模型设计方法,我们使用一个更平稳的快速的轨算算法,我们可以用一个更低的运算算法来大幅地显示我们更精确的快速的离电算。