Although models for count data with over-dispersion have been widely considered in the literature, models for under-dispersion -- the opposite phenomenon -- have received less attention as it is only relatively common in particular research fields such as biodosimetry and ecology. The Good distribution is a flexible alternative for modelling count data showing either over-dispersion or under-dispersion, although no R packages are still available to the best of our knowledge. We aim to present in the following the R package good that computes the standard probabilistic functions (i.e., probability density function, cumulative distribution function, and quantile function) and generates random samples from a population following a Good distribution. The package also considers a function for Good regression, including covariates in a similar way to that of the standard glm function. We finally show the use of such a package with some real-world data examples addressing both over-dispersion and especially under-dispersion.
翻译:虽然文献中已广泛考虑过多分散的计数数据模型,但差分过低 -- -- 反现象 -- -- 模型得到的关注较少,因为在生物多度测量和生态学等特定研究领域,这种模型只是相对常见的。良好的分布是模拟计数数据的灵活替代方法,显示过分散或差分不足,尽管我们所了解的目前还不具备任何R包。我们的目标是在以下R包中提供计算标准概率函数(即概率密度函数、累积分布函数和定量函数)的R包件,并在良好分布后从人群中生成随机样本。该包件还考虑了良好回归的功能,包括以与标准 glm 函数相似的方式进行共变。我们最后用一些真实世界的数据实例来显示这种包的使用,其中既涉及多分散性,又特别涉及下分散性。