In this work, we design differentially private hypothesis tests for the following problems in the general linear model: testing a linear relationship and testing for the presence of mixtures. The majority of our hypothesis tests are based on differentially private versions of the $F$-statistic for the general linear model framework, which are uniformly most powerful unbiased in the non-private setting. We also present other tests for these problems, one of which is based on the differentially private nonparametric tests of Couch, Kazan, Shi, Bray, and Groce (CCS 2019), which is especially suited for the small dataset regime. We show that the differentially private $F$-statistic converges to the asymptotic distribution of its non-private counterpart. As a corollary, the statistical power of the differentially private $F$-statistic converges to the statistical power of the non-private $F$-statistic. Through a suite of Monte Carlo based experiments, we show that our tests achieve desired significance levels and have a high power that approaches the power of the non-private tests as we increase sample sizes or the privacy-loss parameter. We also show when our tests outperform existing methods in the literature.
翻译:在这项工作中,我们为一般线性模型中的下列问题设计了不同的私人假设测试:测试线性关系和测试混合物的存在;我们大多数假设测试都以一般线性模型框架的美元-统计的不同私人版本为基础,一般线性模型框架的美元-统计格式在非私人环境下统一最强的不偏袒;我们还为这些问题提出了其他测试,其中之一是基于Couch、Kazan、Shi、Bray和Groce(CCS 2019)的差别私人非参数测试,特别适合小型数据集制度;我们表明,差异性私人美元-统计式测试与非私人对应方的无特征分布相融合;作为必然结果,差异私人美元-统计式的统计能力与非私人美元-统计式的统计能力相融合;我们通过基于Monte Carlo的成套实验,表明我们的测试达到了理想的意义水平,并且具有很高的力量,随着我们增加样本大小或隐私损失参数而接近非私人测试的力量。我们还在文献中展示了现有方法。