Non-probability sampling is prevailing in survey sampling, but ignoring its selection bias leads to erroneous inferences. We offer a unified nonparametric calibration method to estimate the sampling weights for a non-probability sample by calibrating functions of auxiliary variables in a reproducing kernel Hilbert space. The consistency and the limiting distribution of the proposed estimator are established, and the corresponding variance estimator is also investigated. Compared with existing works, the proposed method is more robust since no parametric assumption is made for the selection mechanism of the non-probability sample. Numerical results demonstrate that the proposed method outperforms its competitors, especially when the model is misspecified. The proposed method is applied to analyze the average total cholesterol of Korean citizens based on a non-probability sample from the National Health Insurance Sharing Service and a reference probability sample from the Korea National Health and Nutrition Examination Survey.
翻译:在调查抽样中普遍存在非概率抽样,但忽视其选择偏好会导致错误的推断。我们提供了一个统一的非参数校准方法,通过在复制的内核Hilbert空间校准辅助变量的功能来估计非概率抽样的抽样权重。确定了拟议估算器的一致性和有限分布,并对相应的差异估计器进行了调查。与现有工程相比,拟议方法更为健全,因为没有为非概率抽样的挑选机制作出参数假设。数字结果表明,拟议方法优于竞争者,特别是在模型定义错误的情况下。拟议方法用于分析韩国公民的平均胆固醇总量,依据是国家健康保险共享服务局的非概率抽样,以及韩国国家健康和营养调查的参考概率样本。