Spatial generalized linear mixed models (SGLMMs) are popular and flexible models for non-Gaussian spatial data. They are useful for spatial interpolations as well as for fitting regression models that account for spatial dependence, and are commonly used in many disciplines such as epidemiology, atmospheric science, and sociology. Inference for SGLMMs is typically carried out under the Bayesian framework at least in part because computational issues make maximum likelihood estimation challenging, especially when high-dimensional spatial data are involved. Here we provide a computationally efficient projection-based maximum likelihood approach and two computationally efficient algorithms for routinely fitting SGLMMs. The two algorithms proposed are both variants of expectation maximization algorithm, using either Markov chain Monte Carlo or a Laplace approximation for the conditional expectation. Our methodology is general and applies to both discrete-domain (Gaussian Markov random field) as well as continuous-domain (Gaussian process) spatial models. We show, via simulation and real data applications, that our methods perform well both in terms of parameter estimation as well as prediction. Crucially, our methodology is computationally efficient and scales well with the size of the data and is applicable to problems where maximum likelihood estimation was previously infeasible.
翻译:空间通用线性混合模型(SGLMMs)是非古日空间数据流行和灵活的模型,可用于空间间推和适合反映空间依赖性的回归模型,并用于流行病学、大气科学、社会学等许多学科,对古日光线性混合模型的推论通常至少部分地在巴伊西亚框架内进行,因为计算问题使得最有可能估计具有挑战性,特别是在涉及高维空间数据的情况下。我们在这里提供了一种基于预测的高效最大可能性预测法和两种用于定期安装SGLMMs的高效计算算法。提出的两种算法都是预期最大化算法的变式,使用马尔科夫链蒙特卡洛或拉普尔近似法作为有条件的预期。我们的方法很笼统,适用于离散区(加乌西南马尔科夫随机场)和连续区(加西南进程)空间模型。我们通过模拟和真实数据应用,表明我们的方法在参数估计和预测两方面都运作良好。关键的是,我们的方法在以往的尺度和可适用度上,在可计算的可能性和可应用的尺度上都存在问题。