Nonresponse is a common problem in survey sampling. Appropriate treatment can be challenging, especially when dealing with detailed breakdowns of totals. Often, the nearest neighbor imputation method is used to handle such incomplete multinomial data. In this article, we investigate the nearest neighbor ratio imputation estimator, in which auxiliary variables are used to identify the closest donor and the vector of proportions from the donor is applied to the total of the recipient to implement ratio imputation. To estimate the asymptotic variance, we first treat the nearest neighbor ratio imputation as a special case of predictive matching imputation and apply the linearization method of \cite{yang2020asymptotic}. To account for the non-negligible sampling fractions, parametric and generalized additive models are employed to incorporate the smoothness of the imputation estimator, which results in a valid variance estimator. We apply the proposed method to estimate expenditures detail items based on empirical data from the 2018 collection of the Service Annual Survey, conducted by the United States Census Bureau. Our simulation results demonstrate the validity of our proposed estimators and also confirm that the derived variance estimators have good performance even when the sampling fraction is non-negligible.
翻译:在抽样调查中,一个常见的问题是没有答复。适当的处理可能具有挑战性,特别是在处理总和的详细细目时。通常,使用最近的近邻估算法来处理这种不完整的多数值数据。在本篇文章中,我们调查最近的邻居比率估算估计估计器,其中使用辅助变量确定最接近的捐助者,对接受者总数适用捐助者比例的矢量,以实施估算率。在估算无症状差异时,我们首先将最近的邻居比率估算器作为预测匹配估算的特例,并应用了\cite{Yang20202020-symptisty}的线性化方法。为了说明非忽略的抽样分数、参数和通用添加模型,以纳入估算器的平稳性,从而得出有效的差异估计器。我们采用拟议方法,根据2018年美国人口普查局进行的服务年度调查收集的经验性数据估算支出细目。我们的模拟结果显示,我们提议的估算器的精确度为测算结果,即使测算器的误差也是准确的。