This paper proposes multivariate copula models for hierarchical data. They account for two types of correlation: one is between variables measured on the same unit and the other is a correlation between units in the same cluster. This model is used to carry out copula regression for hierarchical data that gives cluster specific prediction curves. In the simple case where a cluster contains two units and where two variables are measured on each one, the new model is constructed within a D-vine. Then we focus on situations where two variables are measured on the units of a cluster of arbitrary size. The proposed copula density has an explicit form; it is expressed in terms of three copula families. We study the properties of the model; compare it to the linear mixed model and end with special cases. When the three copula families and the marginal distributions are normal, the model is equivalent to a normal linear mixed model with random, cluster specific, intercepts. The method to select the three copula families and to estimate their parameters are proposed. We perform a Monte Carlo study of the parameter estimators. A data set on the marks of students in several school is used to implement the proposed model and to compare its performance to standard normal mixed linear models.
翻译:本文提出了一种基于顺序模型的异方差数据处理方法。该方法可以在数据中考虑协变量的影响,使得建立的模型更为精确。我们给出了基于随机游走的参数估计与检验方法。在实验中,我们通过对学生数学成绩的分析来说明了该方法的有效性,同时与其他现有的建模方法进行比较分析。此外,还给出了对此类建模方法的一些展望。
(Translated title: A new sequential model for heteroscedastic data)
(Translated abstract:
This paper proposes a method for processing heteroscedastic data based on a sequential model. This method can consider the impact of covariates in the data, making the established model more accurate. We provide parameter estimation and testing methods based on random walks. In the experiment, we demonstrate the effectiveness of the method by analyzing the math scores of students and compare it with other existing modeling methods. In addition, we provide some prospects for such modeling methods.)