Generalized linear mixed models are powerful tools for analyzing clustered data, where the unknown parameters are classically (and most commonly) estimated by the maximum likelihood and restricted maximum likelihood procedures. However, since the likelihood based procedures are known to be highly sensitive to outliers, M-estimators have become popular as a means to obtain robust estimates under possible data contamination. In this paper, we prove that, for sufficiently smooth general loss functions defining the M-estimators in generalized linear mixed models, the tail probability of the deviation between the estimated and the true regression coefficients have an exponential bound. This implies an exponential rate of consistency of these M-estimators under appropriate assumptions, generalizing the existing exponential consistency results from univariate to multivariate responses. We have illustrated this theoretical result further for the special examples of the maximum likelihood estimator and the robust minimum density power divergence estimator, a popular example of model-based M-estimators, in the settings of linear and logistic mixed models, comparing it with the empirical rate of convergence through simulation studies.
翻译:广义线性混合模型是分析聚类数据的强大工具。其中未知参数通常通过最大似然估计和受限制最大似然估计过程来估计。然而,由于似然度估计程序已知对离群值非常敏感,因此M-估计已经成为一种在可能数据污染的条件下获得健壮估计的手段。在本文中,我们证明对于定义广义线性混合模型中M-估计的足够平滑的一般损失函数,估计和真实回归系数之间的偏差尾概率具有指数限制。这意味着在适当的假设下,这些M-估计具有指数收敛速度的一致性,将现有的单变量响应的指数一致性结果推广到多元响应。我们进一步以线性和逻辑混合模型的特殊例子为例,介绍了这一理论结果,比较了通过模拟研究获得的经验收敛速度与之。同时,并将最大似然估计和健壮最小密度功率分歧估计的特殊例子应用于此。