Making informed decisions about model adequacy has been an outstanding issue for regression models with discrete outcomes. Standard assessment tools for such outcomes (e.g. deviance residuals) often show a large discrepancy from the hypothesized pattern even under the true model and are not informative especially when data are highly discrete (e.g. binary). To fill this gap, we propose a quasi-empirical residual distribution function for general discrete (e.g. ordinal and count) outcomes that serves as an alternative to the empirical Cox-Snell residual distribution function. The assessment tool we propose is a principled approach and does not require injecting noise into the data. When at least one continuous covariate is available, we show asymptotically that the proposed function converges uniformly to the identity function under the correctly specified model, even with highly discrete outcomes. Through simulation studies, we demonstrate empirically that the proposed quasi-empirical residual distribution function outperforms commonly used residuals for various model assessment tasks, since it is close to the hypothesized pattern under the true model and significantly departs from this pattern under model misspecification, and is thus an effective assessment tool.
翻译:就模型适足性作出知情决定一直是具有离散结果的回归模型的一个未决问题。这类结果的标准评估工具(如偏离常规残留物)往往显示与假设规模模式有很大差异,即使在真实模型下也是如此,而且缺乏信息,特别是在数据高度离散(如二进制)的情况下。为填补这一差距,我们提议为一般离散(如交点和计数)结果设定一个准经验性剩余分布功能,作为经验性Cox-Snell剩余分布功能的替代。我们提议的评估工具是一种原则性方法,不需要在数据中注入噪音。当至少有一个连续的共变法可用时,我们不小心地表明拟议的功能与正确指定的模型下的身份功能一致,即使有高度离散的结果。通过模拟研究,我们从经验上表明,拟议的准精神剩余分布功能超越了各种模型评估任务通常使用的剩余物,因为它接近于真实模型下的虚构模式,因此与模型中的一种模式有很大差异,因此是一种有效的评估工具。