Multilevel regression and poststratification (MRP) is a popular method for addressing selection bias in subgroup estimation, with broad applications across fields from social sciences to public health. In this paper, we examine the inferential validity of MRP in finite populations, exploring the impact of poststratification and model specification. The success of MRP relies heavily on the availability of auxiliary information that is strongly related to the outcome. To enhance the fitting performance of the outcome model, we recommend modeling the inclusion probabilities conditionally on auxiliary variables and incorporating flexible functions of estimated inclusion probabilities as predictors in the mean structure. We present a statistical data integration framework that offers robust inferences for probability and nonprobability surveys, addressing various challenges in practical applications. Our simulation studies indicate the statistical validity of MRP, which involves a tradeoff between bias and variance, with greater benefits for subgroup estimates with small sample sizes, compared to alternative methods. We have applied our methods to the Adolescent Brain Cognitive Development (ABCD) Study, which collected information on children across 21 geographic locations in the U.S. to provide national representation, but is subject to selection bias as a nonprobability sample. We focus on the cognition measure of diverse groups of children in the ABCD study and show that the use of auxiliary variables affects the findings on cognitive performance.
翻译:多级回归和后分级(MRP)是解决分组估算中选择偏差的流行方法,从社会科学到公共卫生等各个领域都有广泛的应用。在本文件中,我们审查了MRP在社会科学到公共卫生等各个领域的预测偏差。我们审查了MRP在有限人口中的推论有效性,探索了后分制和示范规格的影响。MRP的成功在很大程度上取决于能否获得与结果密切相关的辅助性信息。为了提高结果模型的恰当性能,我们建议以辅助变量为条件,对纳入概率进行建模模型,并纳入将估计包容概率的灵活功能作为平均结构中的预测因素。我们提出了一个统计数据整合框架,为概率和不概率调查提供了有力的推论,解决了实际应用中的各种挑战。我们的模拟研究表明了MRP的统计有效性,这涉及偏差和差异之间的权衡,与替代方法相比,对抽样规模小的分组估算的好处更大。我们运用了我们的方法来进行青少年大脑发育(ABCD)研究,该研究收集了全美21个地理位置儿童的信息,以提供国家代表性和非概率调查。但我们不得不选择儿童不具有选择性的变量的样本。</s>