Multilevel regression and poststratification (MRP) has become a popular approach for selection bias adjustment in subgroup estimation, with widespread applications from social sciences to public health. We examine the statistical properties of MRP in connection with poststratification and hierarchical models. The success of MRP prominently depends on the availability of auxiliary information strongly related to the outcome. To improve the outcome model fitting performances, we recommend modeling inclusion mechanisms conditional on auxiliary variables and adding a flexible function of estimated inclusion probabilities in the mean structure. We present a framework for statistical data integration and robust inferences of probability and nonprobability surveys, providing solutions to various challenges in practical applications. Our simulation studies indicate the statistical validity of MRP with a tradeoff between bias and variance, and the improvement over alternative methods is mainly on subgroup estimates with small sample sizes. Our development is motivated by the Adolescent Brain Cognitive Development (ABCD) Study that has collected children across 21 U.S. geographic locations for national representation but is subject to selection bias as a nonprobability sample. We apply the methods for population inferences to evaluate cognition performances of diverse groups of children in the ABCD study and demonstrate that the use of auxiliary variables affects the inferential findings.
翻译:多层次回归和后分化(MRP)已成为在分组估算中选择偏见调整的流行方法,从社会科学到公共卫生的广泛应用。我们检查了MRP与职位分配和等级模式有关的统计特性。MRP的成功主要取决于与结果密切相关的辅助信息的可得性。为了改进结果模型的适配性业绩,我们建议以辅助变量为条件的模型化包容机制,并增加一个在平均结构中估计包容概率的灵活功能。我们提出了一个统计数据整合框架,并提出了概率和不概率调查的有力推断,为实际应用中的各种挑战提供了解决办法。我们的模拟研究表明,MRP在统计上的有效性与偏差和差异之间的权衡,替代方法的改进主要取决于小抽样规模的分组估算。我们的发展是青少年脑大脑发育发展(ABCD)研究的动力,该研究收集了21个美国各地的儿童作为国家代表性,但有可能选择偏差作为非概率抽样。我们采用了人口推断方法来评估不同群体在差异和差异之间进行对比的认知性表现。ACD研究显示AB的辅助性研究会影响儿童在ACD的变数中的变化。