Motivated by the simultaneous association analysis with the presence of latent confounders, this paper studies the large-scale hypothesis testing problem for the high-dimensional confounded linear models with both non-asymptotic and asymptotic false discovery control. Such model covers a wide range of practical settings where both the response and the predictors may be confounded. In the presence of the high-dimensional predictors and the unobservable confounders, the simultaneous inference with provable guarantees becomes highly challenging, and the unknown strong dependency among the confounded covariates makes the challenge even more pronounced. This paper first introduces a decorrelating procedure that shrinks the confounding effect and weakens the correlations among the predictors, then performs debiasing under the decorrelated design based on some biased initial estimator. Standardized test statistics are then constructed and the corresponding asymptotic normality property is established. Furthermore, a simultaneous inference procedure is proposed to identify significant associations, and both the finite-sample and asymptotic false discovery bounds are provided. The non-asymptotic result is general and model-free, and is of independent interest. We also prove that, under minimal signal strength condition, all associations can be successfully detected with probability tending to one. Simulation studies are carried out to evaluate the performance of the proposed approach and compare it with other competing methods. The proposed procedure is further applied to detect the gene associations with the anti-cancer drug sensitivities.
翻译:本文在与潜在混淆者同时进行的关联分析的推动下,研究与潜在混淆者同时进行的关联分析,研究高维混凝土线性模型的大规模假设测试问题,发现高维混凝土线性模型的大规模假设测试问题,该模型既具有非消毒效应,又具有非消毒的虚假发现控制,这种模型涵盖各种实际环境,其中反应和预测者都可能相互混淆。在高维预测者和不易观察的混淆者在场的情况下,与可辨识的保证同时进行的推论变得极具挑战性,而同源共变的共变的共变种性之间又具有未知的强烈依赖性,使得挑战更加明显。本文首先引入了一种装饰性程序,该程序缩小了混杂效应,削弱了预测者之间的关联,从而削弱了预测者之间的关联,然后根据某些偏差的初步估计,在与设计相关的设计下,在设计下,在设计标准化的测试统计数据中,并建立了相应的防腐蚀性常态特性特性。此外,还提出了一种同时推论方法,即进一步采用定型和反毒的基因发现。