Correlated binary response data with covariates are ubiquitous in longitudinal or spatial studies. Among the existing statistical models the most well-known one for this type of data is the multivariate probit model, which uses a Gaussian link to model dependence at the latent level. However, a symmetric link may not be appropriate if the data are highly imbalanced. Here, we propose a multivariate skew-elliptical link model for correlated binary responses, which includes the multivariate probit model as a special case. Furthermore, we perform Bayesian inference for this new model and prove that the regression coefficients have a closed-form unified skew-elliptical posterior. The new methodology is illustrated by application to COVID-19 pandemic data from three different counties of the state of California, USA. By jointly modeling extreme spikes in weekly new cases, our results show that the spatial dependence cannot be neglected. Furthermore, the results also show that the skewed latent structure of our proposed model improves the flexibility of the multivariate probit model and provides better fit to our highly imbalanced dataset.
翻译:在纵向或空间研究中,与共变相相关的二进制反应数据是普遍存在的。在现有的统计模型中,这类数据最著名的是多变式正方位模型,它使用高斯与潜层的模型依赖关系。然而,如果数据高度不平衡,则对称联系可能不合适。在这里,我们建议了相关二进制反应的多变式正方位-异方位链接模式,其中包括多变性正方位模型,作为特例。此外,我们对这一新模型进行巴伊西亚推论,并证明回归系数具有封闭式的统一正方位螺旋-椭圆形远方位模型。新的方法通过美国加利福尼亚州三个州对COVID-19大流行数据的应用加以说明。通过在每周新案例中联合模拟极端上升,我们的结果表明空间依赖性不容忽视。此外,结果还表明,我们提议的模型的扭曲潜伏性潜伏结构提高了多变方位正位正位模型的灵活性,更适合我们高度失衡的数据。