Multilevel regression and poststratification (MRP) has become a popular approach for selection bias adjustment in subgroup estimation, with widespread applications from social sciences to public health. We examine the finite population inferential validity of MRP in connection with poststratification and model specification. The success of MRP prominently depends on the availability of auxiliary information strongly related to the outcome. To improve the outcome model fitting performances, we recommend modeling inclusion mechanisms conditional on auxiliary variables and adding flexible functions of estimated inclusion probabilities as predictors in the mean structure. We present a framework for statistical data integration and robust inferences of probability and nonprobability surveys, providing solutions to various challenges in practical applications. Our simulation studies indicate the statistical validity of MRP with a tradeoff between bias and variance, and the improvement over alternative methods is mainly on subgroup estimates with small sample sizes. Our development is motivated by the Adolescent Brain Cognitive Development (ABCD) Study that has collected children's information across 21 U.S. geographic locations for national representation but is subject to selection bias as a nonprobability sample. We apply the methods for population inferences to evaluate the cognition measure of diverse groups of children in the ABCD study and demonstrate that the use of auxiliary variables affects the inferential findings.
翻译:多层次回归和后处理(MRP)已成为在分组估计中选择偏见调整的流行方法,从社会科学到公共卫生的广泛应用。我们检查了MRP在后处理和示范规格方面的有限人口推断有效性。MRP的成功主要取决于能否获得与结果密切相关的辅助信息。为了改进成果模型的适配性业绩,我们建议以辅助变量为条件,建立包容性机制模型,并增加在平均结构中作为预测因素的估计包容概率的灵活功能。我们提出了一个统计数据整合框架,并有力地推断概率和不概率调查,为实际应用中的各种挑战提供了解决办法。我们的模拟研究表明MRP在统计上的有效性,在偏差和差异之间作出权衡,替代方法的改进主要取决于抽样规模小的分组估算。我们的发展受到青少年脑智力发展研究的推动,该研究收集了21个美国地理区域的儿童信息,用于国家代表性,但需选择不易受选取的偏差作为抽样。我们采用的人口预测方法,在预测中,在评估儿童结果的变量中使用方面,从而评估儿童状况的可变性研究。