Models such as the zero-inflated and zero-altered Poisson and zero-truncated binomial are well-established in modern regression analysis. We propose a super model that jointly and maximally unifies alteration, inflation, truncation and deflation for counts, given a 1- or 2-parameter parent (base) distribution. Seven disjoint sets of special value types are accommodated because all but truncation have parametric and nonparametric variants. Some highlights include: (i) the mixture distribution is exceeding flexible, e.g., up to seven modes; (ii) under-, equi- and over-dispersion can be handled using a negative binomial (NB) parent, with underdispersion handled by a novel Generally-Truncated-Expansion method; (iii) overdispersion can be studied holistically in terms of the four operators; (iv) an important application: heaped and seeped data from retrospective self-reported surveys are readily handled, e.g., spikes and dips which are located virtually anywhere; (v) while generally-altered regression explains why observations are there, generally-inflated regression accounts for why they are there in excess, and generally-deflated regression explains why observations are not there; (vi) the VGAM R package implements the methodology based on Fisher scoring and multinomial logit model (Poisson, NB, zeta and logarithmic parents are implemented.) The GAITD-NB has potential to become the Swiss army knife of count distributions.
翻译:在现代回归分析中,零膨胀和零调整的Poisson和零调整的二进制二进制等模型已经完全确立。我们提出了一个超级模型,根据1或2参数的父(基)分布,对计数进行联合和最大一致的修改、通货膨胀、脱节和通缩。7套特殊值类型脱节,因为所有除脱节之外的特殊类型都有参数和非参数变量。一些亮点包括:(一) 混合物分布超灵活,例如,最高可达7种模式;(二) 低、等和超分的双向型(NB)母体,可联合和最大程度地统一调整、通货膨胀、脱节和通缩计数。 (三) 超分解可以整体地研究这4个操作者;(四) 一个重要应用程序:从追溯性自我报告调查中得出的模型化和可见的数据很容易被处理,例如,那里几乎位于任何地点的螺旋上升和缩缩略图; (五) 总体而言,螺旋式的螺旋式的螺旋折折缩方法可以解释为何在GA的折折中,而一般的螺旋式的螺旋的螺旋的螺旋的螺旋的螺旋在的螺旋。