Small area estimation has become an important tool in official statistics, used to construct estimates of population quantities for domains with small sample sizes. Typical area-level models function as a type of heteroscedastic regression, where the variance for each domain is assumed to be known and plugged in following a design-based estimate. Recent work has considered hierarchical models for the variance, where the design-based estimates are used as an additional data point to model the latent true variance in each domain. These hierarchical models may incorporate covariate information, but can be difficult to sample from in high-dimensional settings. Utilizing recent distribution theory, we explore a class of Bayesian hierarchical models for small area estimation that smooth both the design-based estimate of the mean and the variance. In addition, we develop a class of unit-level models for heteroscedastic Gaussian response data. Importantly, we incorporate both covariate information as well as spatial dependence, while retaining a conjugate model structure that allows for efficient sampling. We illustrate our methodology through an empirical simulation study as well as an application using data from the American Community Survey.
翻译:在官方统计中,小面积估计已成为一个重要的工具,用于为样本规模小的领域建立人口数量估计数。典型的面积级模型作为一种类型的超度回归作用,假定每个领域的差异为已知,并被插入到基于设计的估计中。最近的工作考虑了差异的等级模型,根据设计的估计,以基于设计的估计数作为额外的数据点,用以模拟每个领域的潜在真实差异。这些等级模型可能包含共变信息,但很难从高维环境中进行抽样。我们利用最近的分布理论,探索了一种小面积估算的巴伊西亚等级模型,既可以顺利地根据设计对平均值和差异作出估计。此外,我们还开发了一组单位级模型,用于进行超度测量反应数据。重要的是,我们既纳入共变信息,又纳入空间依赖性,同时保留一个能够有效取样的二次模型结构。我们通过实验性模拟研究以及使用美国社区调查数据的应用来说明我们的方法。