We study estimation and testing in the Poisson regression model with noisy high dimensional covariates, which has wide applications in analyzing noisy big data. Correcting for the estimation bias due to the covariate noise leads to a non-convex target function to minimize. Treating the high dimensional issue further leads us to augment an amenable penalty term to the target function. We propose to estimate the regression parameter through minimizing the penalized target function. We derive the L1 and L2 convergence rates of the estimator and prove the variable selection consistency. We further establish the asymptotic normality of any subset of the parameters, where the subset can have infinitely many components as long as its cardinality grows sufficiently slow. We develop Wald and score tests based on the asymptotic normality of the estimator, which permits testing of linear functions of the members if the subset. We examine the finite sample performance of the proposed tests by extensive simulation. Finally, the proposed method is successfully applied to the Alzheimer's Disease Neuroimaging Initiative study, which motivated this work initially.
翻译:我们用噪音高维共变体研究Poisson回归模型的估算和测试,该模型在分析吵闹的大数据方面有着广泛的应用。纠正由于共变噪音造成的估计偏差,导致一个非convex目标功能最小化。处理高维问题进一步导致我们增加一个可调整的惩罚期,以达到目标功能。我们提议通过尽量减少受罚目标功能来估计回归参数。我们从测算仪的L1和L2趋同率中得出测深率,并证明可变选择的一致性。我们进一步确定该子子参数的任何子参数的无症状常度,只要其基点发展得足够慢,该子可以具有无限的多个组成部分。我们根据估计器的无孔正常度进行Wald和计分测试,允许在子进行时对成员线性功能进行测试。我们通过广泛的模拟来审查拟议测试的有限样本性能。最后,拟议方法成功地应用于最初推动这项工作的阿尔茨海默氏病神经成像倡议研究。