Model selection criteria are rules used to select the best statistical model among a set of candidate models, striking a trade-off between goodness of fit and model complexity. Most popular model selection criteria measure the goodness of fit trough the model log-likelihood function, yielding to non-robust criteria. This paper presents a new family of robust model selection criteria for independent but not identically distributed observations (i.n.i.d.o.) based on the R\'enyi's pseudodistance (RP). The RP-based model selection criterion is indexed with a tuning parameter $\alpha$ controlling the trade-off between efficiency and robustness. Some theoretical results about the RP criterion are derived and the theory is applied to the multiple linear regression model, obtaining explicit expressions of the model selection criterion. Moreover, restricted models are considered and explicit expressions under the multiple linear regression model with nested models are accordingly derived. Finally, a simulation study empirically illustrates the robustness advantage of the method.
翻译:模型选择准则是用于在候选模型集合中选择最佳统计模型的规则,平衡拟合程度和模型复杂度之间的权衡。大多数流行的模型选择准则通过模型对数似然函数来度量拟合程度,从而得出非鲁棒的准则。本文提出了一种基于Rényi伪距离(RP)的用于独立非同分布观察(i.n.i.d.o.)的鲁棒模型选择准则新系列。RP-based模型选择标准由调节参数$\alpha$进行索引,控制效率和鲁棒性之间的权衡。推导了一些关于RP准则的理论结果,并将理论应用于多元线性回归模型,获得了模型选择准则的显式表达式。此外,考虑了限制模型,并相应地在多元线性回归模型下使用嵌套模型推导了显式表达式。最后,通过模拟研究,实证说明了该方法的鲁棒性优势。