It is of soaring demand to develop statistical analysis tools that are robust against contamination as well as preserving individual data owners' privacy. In spite of the fact that both topics host a rich body of literature, to the best of our knowledge, we are the first to systematically study the connections between the optimality under Huber's contamination model and the local differential privacy (LDP) constraints. In this paper, we start with a general minimax lower bound result, which disentangles the costs of being robust against Huber's contamination and preserving LDP. We further study four concrete examples: a two-point testing problem, a potentially-diverging mean estimation problem, a nonparametric density estimation problem and a univariate median estimation problem. For each problem, we demonstrate procedures that are optimal in the presence of both contamination and LDP constraints, comment on the connections with the state-of-the-art methods that are only studied under either contamination or privacy constraints, and unveil the connections between robustness and LDP via partially answering whether LDP procedures are robust and whether robust procedures can be efficiently privatised. Overall, our work showcases a promising prospect of joint study for robustness and local differential privacy.
翻译:尽管这两个专题都蕴藏着丰富的文献,但据我们所知,我们是首先系统地研究Huber污染模式下的最佳性和当地差异隐私限制之间的最佳联系。在本文件中,我们首先从一般小型下限结果着手,分解对Huber污染和保存LDP的稳健性的成本。我们进一步研究了四个具体例子:两点测试问题、潜在分散的中间估计问题、非对称密度估计问题和单向中位估计问题。对于每一个问题,我们展示了在存在污染和LDP限制的情况下最理想的程序,评论了仅受污染或隐私限制而研究的与最新方法的联系,并通过部分回答LDP程序是否稳健性以及能否高效地利用稳健性程序来揭示稳健性与LDP之间的联系。总体而言,我们的工作展示了对稳健性和地方差异进行联合研究的前景。