The paper considers parameter estimation in count data models using penalized likelihood methods. The motivating data consists of multiple independent count variables with a moderate sample size per variable. The data were collected during the assessment of oral reading fluency (ORF) in school-aged children. A sample of fourth-grade students were given one of ten available passages to read with these differing in length and difficulty. The observed number of words read incorrectly (WRI) is used to measure ORF. Three models are considered for WRI scores, namely the binomial, the zero-inflated binomial, and the beta-binomial. We aim to efficiently estimate passage difficulty, a quantity expressed as a function of the underlying model parameters. Two types of penalty functions are considered for penalized likelihood with respective goals of shrinking parameter estimates closer to zero or closer to one another. A simulation study evaluates the efficacy of the shrinkage estimates using Mean Square Error (MSE) as metric. Big reductions in MSE relative to unpenalized maximum likelihood are observed. The paper concludes with an analysis of the motivating ORF data.
翻译:文件用惩罚性可能性方法在计数数据模型中考虑参数估计。激励数据由多个独立计数变量组成,每个变量的样本大小比较中。数据是在评估学龄儿童口服阅读流畅(ORF)期间收集的。四年级学生的抽样在长度和难度方面有10个可用段落中被给一个样本,这些段落的阅读有不同的长度和难度。读错的字数(WRI)用于测量ORF。三个模型用于进行反统计分数,即二进制、零充气的二进制和乙二进制。我们的目的是有效地估计通过困难,这是基本模型参数的一个函数。两种惩罚功能被认为具有受限制的可能性,其目标分别是缩小参数估计接近于零或接近于1;模拟研究用中位平方错误(MSE)来评估缩小估计值的功效。观察到微小估计值相对于未受惩罚的最大可能性的大幅下降。观测结果与未受惩罚的最大可能性相比。论文最后对激励的ORF数据进行了分析。