Preferential sampling has attracted considerable attention in geostatistics since the pioneering work of Diggle et al. (2010). A variety of likelihood-based approaches have been developed to correct estimation bias by explicitly modelling the sampling mechanism. While effective in many applications, these methods are often computationally expensive and can be susceptible to model misspecification. In this paper, we present a surprising finding: some existing non-likelihood-based methods that ignore preferential sampling can still produce unbiased and consistent estimators under the widely used framework of Diggle et al. (2010) and its extensions. We investigate the conditions under which preferential sampling can be ignored and develop relevant estimators for both regression and covariance parameters without specifying the sampling mechanism parametrically. Simulation studies demonstrate clear advantages of our approach, including reduced estimation error, improved confidence interval coverage, and substantially lower computational cost. To show the practical utility, we further apply it to a tropical forest data set.
翻译:自Diggle等人(2010)的开创性工作以来,偏好抽样在地统计学领域引起了广泛关注。为校正估计偏差,多种基于似然的方法通过显式建模抽样机制得以发展。尽管这些方法在许多应用中有效,但通常计算成本高昂,且易受模型设定错误的影响。本文提出一个令人惊讶的发现:在Diggle等人(2010)的广泛使用框架及其扩展下,某些忽略偏好抽样的现有非似然方法仍能产生无偏且一致的估计量。我们探究了偏好抽样可被忽略的条件,并为回归参数和协方差参数开发了不依赖参数化抽样机制的相关估计量。模拟研究展示了我们方法的明显优势,包括降低估计误差、提高置信区间覆盖率以及显著减少计算成本。为体现其实用性,我们进一步将其应用于热带森林数据集。