Statistical inference in the presence of nuisance functionals with complex survey data is an important topic in social and economic studies. The Gini index, Lorenz curves and quantile shares are among the commonly encountered examples. The nuisance functionals are usually handled by a plug-in nonparametric estimator and the main inferential procedure can be carried out through a two-step generalized empirical likelihood method. Unfortunately, the resulting inference is not efficient and the nonparametric version of the Wilks' theorem breaks down even under simple random sampling. We propose an augmented estimating equations method with nuisance functionals and complex surveys. The second-step augmented estimating functions obey the Neyman orthogonality condition and automatically handle the impact of the first-step plug-in estimator, and the resulting estimator of the main parameters of interest is invariant to the first step method. More importantly, the generalized empirical likelihood based Wilks' theorem holds for the main parameters of interest under the design-based framework for commonly used survey designs, and the maximum generalized empirical likelihood estimators achieve the semiparametric efficiency bound. Performances of the proposed methods are demonstrated through simulation studies and an application using the dataset from the New York City Social Indicators Survey.
翻译:具有复杂调查数据的骚扰功能的统计推论是社会和经济研究的一个重要专题。吉尼指数、洛伦茨曲线和四分位数共享是常见的例子之一。骚扰功能通常由插头非对称估测器处理,主要的推论程序可以通过两步普遍经验概率法进行。不幸的是,由此得出的推论效率不高,即使通过简单的随机抽样,威尔克斯的正角也断裂了非对称版本。我们建议一种扩大估计方程方法,加上干扰功能和复杂的调查。第二步扩大估计功能符合奈曼或多位性条件,自动处理第一步超位估测器的影响,由此产生的主要利益参数的估测器与第一步方法不尽相同。更重要的是,基于威尔克斯的理论模型为通用调查设计框架的主要利益参数保留了普遍的经验概率。通过模拟分析结果,通过模拟分析结果,通过模拟结果,通过模拟方法,通过模拟分析,得出了最大的普遍经验概率。